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Preface 


While still a relatively new graduate student, I once remarked to my advi- 
sor, Jim Cushing, that I still didn’t understand quantum mechanics. To this 
he promptly replied: “You'll spend the rest of your life trying to understand 
quantum mechanics!” Despite countless books that the subject has spawned 
since it first assumed a coherent form in the 1920s, quantum mechanics 
remains notoriously, even legendarily, difficult. Some may believe students 
should be told that physics really isn’t that hard, presumably so as not to 
intimidate them. I disagree: what can be more demoralizing than struggling 
mightily with a subject, only to be told that it’s really not that difficult? 

Let me say it outright, then: quantum mechanics is hard. In writing 
this book, I have not found any “magic bullet” by which I can render 
the subject easily digestible. I have, however, tried to write a book that is 
neither a popularization nor a “standard” text; a book that takes a modern 
approach, rather than one grounded in pedagogical precedent; a book that 
focuses on elucidating the structure and meaning of quantum mechanics, 
leaving comprehensive treatments to the standard texts. 

Above all, I have tried to write with the student in mind. The pri- 
mary target audience is undergraduates about to take, or taking, their first 
quantum course. But my hope is that the book will also serve biologists, 
philosophers, engineers, and other thoughtful people—people who are fasci- 
nated by quantum physics, but find the popularizations too simplistic, and 
the textbooks too advanced and comprehensive—by providing a foothold 
on “real” quantum mechanics, as used by working scientists. 

Popularizations of quantum mechanics are intended not to expound 
the subject as used by working scientists, but rather to discuss “quantum 
weirdness,” such as Bell’s theorem and the measurement problem, in terms 
palatable to interested non-scientists. As such, the mathematical level of 
such books ranges from very low to essentially nonexistent. 

In contrast, the comprehensive texts used in advanced courses often 
make daunting conceptual and mathematical demands on the reader. 
Preparation for such courses typically consists of a modern physics course, 
but these tend to be rather conceptual. Modern physics texts generally 
take a semi-historical approach, discussing topics such as the Bohr atom 
and the Compton effect. Formalism is minimized and description empha- 
sized; the highly abstract mathematical and physical concepts of quantum 
mechanics remain largely untouched. There is thus a rather large gap to be 
bridged, and students in advanced courses may find that they must solve 


x Preface 


problems and learn new applications even while the framework of quantum 
mechanics remains unclear. 

Neither popularization nor standard text, this book is intended to serve 
in a variety of settings: as a primary text in a short course, a supple- 
mentary text in a standard course, a vehicle for independent study, or 
a reference work. Knowledge of elementary calculus and basic complex 
analysis should provide sufficient mathematical background (a condensed 
discussion of these topics appears in Appendix A). 

The book’s modernity is reflected in its overall style and tenor, but 
also in some broad themes, such as the early and extensive use of Dirac 
notation, and the fact that neither wavefunctions nor the time-independent 
Schrödinger equation are granted privileged status. Another such theme is 
the adoption of the “statistical interpretation,” a very useful and lucid way 
to understand how quantum mechanics works in actual practice. Because 
the statistical interpretation is really a broad framework rather than an 
interpretation per se, it is easily “imported” into other approaches as the 
student may find necessary. 

Notable by their absence from the book are many standard topics, such 
as perturbation theory, scattering, and the Hydrogen atom. This is in keep- 
ing with a central motivating idea: that to properly understand the many 
and varied applications of quantum mechanics, one must first properly 
understand its overall structure. This implies a focus on fundamentals, such 
as superposition and time evolution, with the result that they may then be 
developed in a more detailed and explanatory style than in advanced texts. 

Some authors seem to believe that if they provide a clear, elegant, terse 
explanation, one time, any remaining confusion is the student’s responsi- 
bility. I disagree. Having taught (and learned) physics for many years at 
many levels, I find that there are myriad ways to misunderstand the sub- 
ject, so I have tried to make this book especially explanatory and useful 
for the student. Common variations in terminology and notation are clar- 
ified (e.g., the terms quantum state, state vector, and wavefunction). And 
I discuss not only what is right, but what is wrong. For example, although 
position-space and momentum-space are standard topics, students often 
fail to realize that there is but one quantum state, which may be cast 
into various representations. Such potential stumbling blocks are explicitly 
pointed out and explained. 

The great majority of problems are, to my knowledge, new. Most are 
intended to help develop conceptual understanding. A vast array of addi- 
tional problems may be found in other quantum texts. The time-honored 
physics dictum—that one doesn’t understand the physics unless one can 
solve problems—bears repeating here. But so does its lesser-known cousin: 
just solving problems, without the capacity to lucidly discuss those prob- 
lems and the attendant concepts and ideas, may also indicate insufficient 
understanding. 


Preface xi 


In part because this book is intended to transcend the traditional 
physics audience, a few words about studying the subject are in order. 
Much of our intellectual heritage-from art and music to social, political, 
and historical thought—concerns our human experience of the world. By its 
very nature, physics does not, and it is now clear that at the fundamental 
level the physical world doesn’t conform to our preconceived ideas. The 
concepts of physics, particularly quantum mechanics, can be exceedingly 
abstract, their connections to our everyday experiences tenuous at best. 

Because of this physical abstraction, and the requisite mathematical 
sophistication, understanding can be hard to achieve in quantum mechan- 
ics. Nevertheless, I believe that understanding (not memorization) must be 
the goal. To reach it, however, you may need to read more carefully, and 
think more carefully, than ever before. This is an acquired skill! For most 
humans it simply isn’t natural to exert the degree of concentration that 
physics demands—you didn’t think quantum mechanics would be easy, did 
you? The payoff for this hard work, to borrow Victor Weisskopf’s phrase, 
is the joy of insight. 

Essential Quantum Mechanics would not have become a reality absent 
the freedom and support granted me by Northern Arizona University. 
This includes a sabbatical spent, in part, developing the book at Loyola 
University Chicago. Professor Ralph Baierlein generously and critically 
read the manuscript and, as always, provided much wise and deeply 
appreciated counsel. Professor Peter Kosso offered useful comments and 
early encouragement. Sonke Adlung, of Oxford University Press, displayed 
abundant patience, kindness, and professionalism in helping me through 
the publishing process. Oxford’s Chloe Plummer endured my repeated 
underestimates of the time required to correct the manuscript. 

The influence of my late, great, Ph.D. advisor, Jim Cushing—whose life 
put the lie to the notion that scientists are not real intellectuals—permeates 
this book. My wife Katherine has been, and remains, a source of encourage- 
ment and forbearance through thick and thin. She also provided motivation, 
often by asking: When are you going to finish that #&!* book? Finally, I 
must thank my parents. Neither will see this book in print, yet both have 
indelibly impacted my life, and continue to do so, regardless of my age. 

After more than a few years on the planet, it sometimes seems to me 
that there is but one great lesson to be learned. That is that the real worth 
of a life is in contributing to the welfare of others. It is my hope that, in 
some sense, and in some measure, I have done so with this book. 


Flagstaff, Arizona Gary E. Bowman 
May 2007 


The true value of a human being is determined primarily by the mea- 
sure and the sense in which he has attained liberation from the self. 


Albert Einstein (1931) 
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Introduction: 
Three Worlds 


The best things can’t be told: the second best are misunderstood. 
Heinrich Zimmer! 


You may hear quantum mechanics described as “the physics of the very 
small,” or “the physics of atoms, molecules, and subatomic particles,” or 
“our most fundamental physical theory.” But such broad, descriptive state- 
ments reveal nothing of the structure of quantum mechanics. The broad 
goal of this book is to reveal that structure, and the concepts upon which 
it is built, without becoming engulfed in calculations and applications. To 
give us something concrete to hold onto as we venture into the wilder- 
ness before us, and to give us a taste of what lies ahead, let’s first take a 
little trip. 


1.1 Worlds 1 and 2 


Imagine a world; let’s call it World 1. In World 1, everything is made up of 
very small, irreducible units called particles. (Large objects are composed of 
collections of these small units.) Because particles are the fundamental stuff 
of World 1, all physical events there are ultimately describable in terms of 
particle behavior—specifically, in terms of particle trajectories, the motion 
of particles in space as a function of time. Thus, to understand and predict 
events in World 1 we must understand and be able to predict the behavior 
of particles. 

Our observations in World 1, then, are fundamentally observations 
of particle trajectories. Any association of physical properties with the 
particles, beyond their trajectories, is secondary—done to facilitate our 
understanding and predictive abilities. Nevertheless, it’s convenient to pos- 
tulate various measurable physical properties associated with the particles, 
and to give these properties names, such as mass and charge. (The defini- 
tion and measurement of such properties may be a daunting task, but that 
is not our concern here.) 


1 Quoted in Campbell (1985), p. 21. 
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If these postulated physical properties are to be useful for understand- 
ing and predicting particle trajectories, we must construct a connection 
between the properties and the trajectories. This connection consists of 
two parts. First, we propose that the properties give rise to forces. In gen- 
eral, the connection between properties and the forces that they give rise 
to depends both upon the specific properties involved and upon the sys- 
tem’s configuration—the positions and/or velocities of the particles. The 
forces are then connected to the particle trajectories by a set of dynamical 
laws. These dynamical laws, unlike the laws that give the forces themselves, 
are perfectly general: they connect any force, regardless of source, to the 
particle trajectories. 

It is the job of the physicists on World 1 to define the physical properties 
of the particles, the way in which forces arise from these properties, and the 
dynamical laws which connect the forces to particle trajectories. And they 
must do so such that they obtain a consistent theoretical explanation for 
the particle trajectories—the fundamental observable entities of World 1. 

The worldview of World 1 is, of course, that of Newtonian classical 
mechanics. In World 1, the complete description of a system consists of a 
description of the motions in time, that is, the trajectories, of all particles 
in the system. To obtain such a description, we determine the forces arising 
from the particles in the system by virtue of their various associated prop- 
erties and the system configuration. Then, using very general dynamical 
laws—in classical mechanics, these are Newton’s laws of motion—we con- 
nect forces with particle trajectories. Note that in World 1, as in Newtonian 
mechanics, no explanation is given of how forces are transmitted from one 
particle to another. 

Now imagine another world: World 2. As in World 1, the tangible things 
of World 2 are made up of particles, and our goal is to determine and 
predict the trajectories of those particles. Now, however, the forces are 
transmitted from one particle to another by means of intangible fields which 
extend through space. In addition, dynamical properties, such as energy 
and momentum, are associated not only with the particles, but with the 
fields themselves. 

The inhabitants of World 2 have found a simple mathematical algorithm 
such that if they know the fields, they can calculate the forces. Thus, from 
the particles and their associated properties they can find the fields, from 
the fields they can find the forces, and from the forces they can find the 
trajectories. 

World 2 is the world of classical field physics. Here our starting point 
is the field (or equivalently, some potential from which the field is easily 
derived) created by some configuration of particles with their associated 
properties. 

Note that in both World 1 and World 2 what we really observe are 
the particle trajectories. We never really “see” a field or a force, or even 
mass or charge—we only see their consequences, their effects on particle 
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trajectories. Indeed, what we did was propose concepts such as mass, 
charge, and force as a means of providing a consistent theoretical base for 
our observations of particle trajectories, that is, in response to the observed 
trajectories. 


1.2 World 3 


We have one more stop: World 3. First, though, let’s note that although 
the physicists on Worlds 1 and 2 may disagree on the rigorous definition of 
the word “particle,” they would agree that we will rarely get into trouble 
if we think of a particle as a highly localized, even point-like, collection of 
physical properties. 

In comparison to Worlds 1 and 2, World 3 is a very strange place indeed. 
The physicists on World 3 do not talk of particle trajectories. In fact, even 
though they constantly refer to particles, their conception of a particle 
seems vague at best, perhaps even muddled. Certain particle properties, 
such as mass and charge, are well defined in World 3, as in Worlds 1 and 2. 
But dynamical particle properties such as energy, momentum, and posi- 
tion, are quite poorly defined: their values seem to depend not only on the 
particle itself, but also on the sort of measurement we perform on it. 

The key difference between Worlds 1 and 2 on the one hand and World 
3 on the other, however, is the following. In World 2 the particle proper- 
ties and the configuration of a system were used to determine fields, and 
from the fields, forces. (World 1 was similar except that the forces were 
determined directly.) The significance of the forces was their connection, 
by a dynamical law, to the time-dependent motion of the particles in the 
system. 

In World 3, also, particle properties and the configuration of the sys- 
tem provide our starting point: together they give rise to potentials, as in 
World 2. Now, however, we do not make the transition to forces. Instead 
the particle properties and system configuration give rise to a probability 
function. This probability function only allows us to calculate the “chance” 
of finding a particle with a certain value for a dynamical property upon 
measurement. In Worlds 1 and 2 the dynamical law allowed us to calculate 
exactly the time-dependent motion of particles. In World 3 we also have a 
dynamical law; however, it only allows us to calculate the development in 
time of the probability function. 

Although we still refer to particles in World 3, we may get into great 
difficulty if we imagine this to mean point-like units with well-defined 
trajectories. The “particles” of World 3 have poorly defined properties 
which only become well-defined upon measurement. Thus, we should not 
think of particle trajectories; that is, we should not think of a well-defined 
motion of the particles in space as a function of time. Only the probabil- 
ity function, which yields the statistics for obtaining certain values for the 
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dynamical properties of the particles upon measurement, has a well-defined 
development in time. 

The worldview of World 3 is essentially that of the most common inter- 
pretation of quantum mechanics. The contrast between Worlds 1 and 2 
on the one hand and World 3 on the other illustrates, in large measure, 
the fundamental disparity between the classical and quantum worlds. This 
disparity arises at the deepest level. Particles and trajectories, the funda- 
mental constituents of physical reality in the classical world, evidently are 
not well defined—if, indeed, they exist at all—in the quantum world. 


1.3 Problems 


1. An intelligent, thoughtful friend with essentially no mathematics or 
physics background reads this chapter. 

“In World 3,” he says, “the probability function evidently plays a 
fundamental role. But I don’t understand what a probability function 
is. When I asked a mathematician friend to explain, she just smiled 
and said “Oh, a probability function is just a probability distribution.” 
But I don’t know what that means, either.” 

Please explain, in non-technical terms that your friend can under- 
stand, what a probability function, or distribution, is. 

2. Consider World 3. In this world, what cannot be known precisely 
about a “particle”? Is there anything whose time evolution can be 
known precisely in World 3? If so, what is it? 

3. This question and the next help emphasize the contrast between clas- 
sical and quantum physics, by illustrating the overall structure of 
classical mechanics and classical field physics. 

The potential energy of a single, point particle of mass m is given by 
U= ska, where k is a constant. We wish to find x(t), the particle’s 
motion as a function of time. 

(i) Clearly describe, step-by-step, in words, how you will find z(t). 
(ii) Carry out the instructions you provided in part i. 

(iii) Now assume that at time t = 0 the particle is stationary at x = a. 
Find the specific x(t) for this case. 

(iv) Does any information about our point particle’s motion remain 
unknown, or do we now know all of it? 

(v) What is the common name used to describe this sort of system? 

4. Assume that the particle in the preceding question has charge q, and 
that its potential energy U arises from the presence of an electric field. 
(i) Find the particle’s potential, V. 

(ii) What is the electric field in which the particle resides? 


2 
The Quantum Postulates 


Quantum mechanics [is] that mysterious, confusing discipline, which none of us 
really understands but which we know how to use. It works perfectly, as far as we 
can tell, in describing physical reality, but it is a “counter-intuitive discipline,” 
as the social scientists would say. Quantum mechanics is not a theory, but rather 


a framework within which we believe any correct theory must fit. 
Murray Gell-Mann 


Quantum mechanics is often developed from postulates. I’ll also fol- 
low this approach; it has the distinct advantage that the entire theoretical 
foundation of the theory can be written down in a few concise statements. 

The postulate approach is subject to the criticism that it tends to 
emphasize the distinct natures of classical mechanics and quantum mechan- 
ics, and thus largely fails to illuminate the many similarities of the two 
theories. I will try to alleviate this deficiency somewhat in our discussion. 

What might often confound the physics student are apparently dissimi- 
lar statements made regarding some topic in various references. Frequently 
it is only after much study and thought that one finally sees that the vari- 
ous statements are in fact equivalent. The quantum postulates are a good 
example of just such a case. Most quantum mechanics texts contain the 
postulates, but they may not explicitly call them postulates. In addition, 
although the content of the postulates is almost universal, there often seems 
to be almost as many ways to state them as there are authors of quantum 
mechanics texts. 

Before we begin our examination of the postulates, it is worthwhile to 
contemplate what a postulate is. Consider Newton’s laws of motion. An 
astonishing amount of physics has been developed using these three sim- 
ple statements as their foundation. Linear dynamics, rotational dynamics, 
Lagrangian and Hamiltonian mechanics, gravitation, classical statisti- 
cal mechanics, electrodynamics—all of these are grounded in Newton’s 
laws. In a sense these are all, to some degree, derived from Newton’s 
laws. Arguably, even quantum mechanics itself is ultimately grounded in 
Newtonian mechanics.” 


1 M. Gell-Mann, in Mulvey (1981). 
2 Dirac (1978), p. 2. 
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So, what are Newton’s laws derivable from? The answer is: Newton’s 
laws cannot be derived from anything. Any truly fundamental “law of 
nature” must be underivable. That’s what makes it a law of nature. Such a 
law tells us something about nature “in the raw”; there is no way to derive 
it from either physical or mathematical considerations. Of course, we may 
find later that what we had previously thought was truly fundamental really 
isn’t. But no matter: in that case, we simply find that underivability, that 
is, fundamentality, lies at a deeper level than we previously thought. (Some 
may claim that Newton’s laws, for example, are not truly fundamental, but 
are derivable from quantum mechanics. If this view is correct then underiv- 
ability lies at the quantum mechanical level, not at the classical mechanical 
level.) 

A postulate is, essentially, a statement made without proof or ana- 
lytical justification—an underived statement. In physics a postulate is, 
in effect, a proposal, and such a proposal must ultimately stand or fall 
not on derivation, but on verification in the physical world. It must be 
tested experimentally. Newton’s laws of motion were such proposals. Quan- 
tum mechanics also makes such proposals: the quantum postulates. We 
will spend much of this book investigating these postulates and their 
consequences. 


2.1 Postulate 1: The Quantum State 


Information about a quantum-mechanical system is contained in 
the quantum state. The form and development in time of the 
state are determined by the time-dependent Schrödinger equation 
(TDSE): 


ih— = HW (2.1) 
where W is the quantum state and Å is the Hamiltonian operator. 


We have some distance to go before we can fully grasp the meaning 
of Postulate 1. Even now, however, we may utilize it to begin to develop 
one of the most deceptively subtle, and disputed, concepts in quantum 
mechanics: the quantum state.? Elsewhere, you will often find quantum 
states referred to as wavefunctions or state vectors. For now, we will use 
these terms sparingly. The term quantum state is both more general and 
more descriptive, and it reminds us that the quantum state is the analog 
of the state of a classical system. 

Let’s first consider the state of a system at some particular time in 
Newtonian mechanics. In this case the state of the system is given by the 
positions and velocities of all its constituent parts. From this information, 
and from the forces which determine the particles’ subsequent behavior, we 


3 Ballentine (1998), Section 2.1. 
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may (in principle) use Newton’s laws to determine the state of the system 
at any later time. The concept of state for a classical system is thus almost 
intuitively clear. 

Now what does Postulate 1 tell us about the quantum state? As 
stated, HH is the Hamiltonian operator. In quantum mechanics, operators 
are usually denoted by a “hat”. We will develop operators in more depth 
later, but for now it is sufficient to state that H is not just a number, 
or even a function. Rather it involves mathematical operations, such as 
differentiation. 

From Eq. (2.1), the TDSE clearly is not just an algebraic equation, 
but a differential equation.4 The solution to a differential equation is a 
function—in general, a complex function. Consider, for example, the (very 
simple) differential equation: 


& f(x) 


dx? 


+k? f(x) =0, (2.2) 


where k is a real constant and f(x) is the function which is a solution of 
the differential equation. (That is, f(x) is what we must find to solve the 
differential equation.) It is easily verified that the complex function, 


f(x) = eè? = cos(kx) + isin(kx), (2.3) 


is a solution of Eq. (2.2). In Postulate 1, the solution to the TDSE is Y. 
Evidently W is a function of time. In general, Y is complex, and will remain 
so as time evolves (progresses). 

Moreover, quantum states are not simply specifications of the corre- 
sponding classical quantities. Consider a single classical particle. The state 
of a classical particle at some time is given by the three position coor- 
dinates and the three velocity components of the particle. It turns out, 
however, that the quantum state of a single particle at a particular time 
is, in general, a function in space. This means that, in contrast to the clas- 
sical state, we can’t simply specify unique coordinates and velocities of a 
quantum particle. This is a mathematical manifestation of the difference 
between Worlds 1 and 2, on the one hand, and World 3 on the other (as 
described in Chapter 1). 

Finally, note that Postulate 1 states that “information about a 
quantum-mechanical system is contained in the quantum state.” But could 
information also reside somewhere other than in the state? Elsewhere you 
may read statements such as “all information resides in the quantum state,” 
or “the most complete information resides in the quantum state.” In this 
view, the quantum state provides a complete description of a quantum- 
mechanical system. Postulate 1, however, leaves open the possibility that 


4 Often AH involves partial differentiation; then the TDSE is a partial differential 
equation. 
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the quantum state may not be the whole story—that there could be some 
deeper layer of physics beyond quantum mechanics. These two views thus 
entail profoundly different philosophical perspectives. Indeed, the debates 
in foundations of quantum mechanics, still ongoing after 80 years, largely 
spring from the difference in these perspectives. We will return to the issue 
of the completeness of quantum mechanics in Chapter 3. 

Although Postulate 1 is absolutely essential for the construction of 
quantum mechanics, it is virtually devoid of physical content. It tells us 
something about the quantum state and how to find it, but nothing about 
how to connect the quantum state to physically observable predictions. We 
begin building that connection in the next section. 


2.2 Postulate 2: Observables, Operators, and 
Eigenstates 


Properties of a quantum-mechanical system that can (in prin- 
ciple) be observed, or measured, are called observables, and are 
represented by Hermitian operators. Suppose the quantum state 
W,; satisfies: 


AW; = ajV;, (2.4) 


where A is an operator representing an observable, and aj is a 
constant. Then Y; and a; are the jth eigenstate and eigenvalue, 
respectively, of A. If the system is in the state W,, then the result 
of a measurement of the observable represented by the operator 
A must be the eigenvalue aj. 


Postulate 2 tells us that measurable properties are called observables in 
quantum mechanics, and that, in the mathematical formulation of the the- 
ory, observables are represented by Hermitian operators. But what do these 
statements mean? To start to answer that, let’s first briefly revisit classical 
mechanics. If we consider a single particle system, we see that once we know 
the trajectory of the particle, z(t), that is, its position as a function of time, 
we can calculate the other physical properties that we’re usually interested 
in.° For example, the momentum, j = mv, is found by multiplying the par- 
ticle’s mass by its velocity, where the velocity is found by differentiating 
the trajectory z(t). A similar procedure can be used to find, for example, 
the angular momentum about some point. In other words, once we have 
calculated the particle’s trajectory we may simply apply the definitions of 
various physical properties to calculate the values of such properties. 

In quantum mechanics, however, we do not have well-defined particle 
properties to work with. The fundamental entity available to us is not the 


5 Here # denotes the position vector (x,y,z) or, in another common notation, 
(x1, £2, £3). 
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particle trajectory, but the quantum state—a function that is, in general, 
“spread out,” or distributed, in space, and from which we obtain probabili- 
ties. We certainly cannot simply apply, say, the definition of linear momen- 
tum, p= mv, to the quantum state and hope to get out the linear momen- 
tum of the particle. If we tried to do so, we would immediately be faced with 
the question of what to calculate the velocity of ...the quantum state? 

Postulate 2 prescribes the method to resolve these questions. Observ- 
ables will be represented (mathematically) in the theory by Hermitian 
operators. We will develop later what a Hermitian operator is, along with 
other important properties of the operators commonly used in quantum 
mechanics. The important point now is that an operator is, in general, not 
just a scalar, a vector, or a function. It is a mathematical operation, such 
as differentiation with respect to spatial coordinates. 

Postulate 2 then tells us that if a measurement of the observable cor- 
responding to A—call it A—is performed on W,;, the jth eigenstate of A, 
the result will be the eigenvalue a;. Let’s unpack this statement a bit. T hé 
equation of Postulate 2, 


ÂY; = a;¥;, (2.5) 


is an eigenvalue equation. On the left side of Eq. (2.5) the operator A acts 
on the function U;. (Remember that A is a mathematical operation, not 
just a number or a function.®) The right side then tells us that this equals 
the eigenvalue, aj, multiplied by Y}. An eigenvalue is not an operator; it 
is simply a number. 

Now let me state the physical interpretation of Eq. (2.5). Consider a 
system in the quantum state U,;. Then if I measure A, the observable cor- 
responding to the operator A, the measured value will be the eigenvalue 
(number) aj. Let’s say I wish to measure linear momentum, which corre- 
sponds to the operator p. Suppose the system is in the state x, the k’th 
eigenstate of p. Then the relevant eigenvalue equation is 


Dok = Prox: (2.6) 


Our measurement result will be the number pp, an eigenvalue of p. I don’t 
want to delve deeper into eigenvalues and eigenstates at this point. I simply 
want you to begin developing some “feel” for how eigenvalues, eigenstates, 
and eigenvalue equations enter quantum mechanics. 

Please realize that the eigenstates of an operator and the state of a 
physical system are distinct concepts. In discussing both Eqs. (2.5) and 
(2.6), as in Postulate 2, I stipulated that the state of the system was also an 
eigenstate of the relevant operator. However, for any quantum-mechanical 
observable there are infinitely many quantum states that are legitimate 


6 If, for example A is a differential operator, then Eq. (2.5) is a differential equation. 
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states of the system, but are not eigenstates of the relevant operator. For 
such state/operator combinations, eigenvalue equations such as Eqs. (2.5) 
and (2.6) do not apply. 

For example, if ~ is not an eigenstate of A, no number ĝ exists such 
that Aw = By is satisfied. An eigenvalue equation for some operator, say A, 
holds only if the function (state) on which A acts is an eigenstate of A. A 
very important result, then, is that an operator, a mathematical operation, 
may be replaced by the corresponding eigenvalue, a number, but only if the 
state on which the operator acts is an eigenstate of the operator. 

What is the physical import of eigenstates? As we shall see, quantum 
states yield probabilities for measurements of observables. But Postulate 2 
tells us that the probability distribution for an observable is trivial if the 
quantum state is an eigenstate of the corresponding operator. For the state 
W,, for example, the probability distribution for measurement of A is trivial: 
the probability of obtaining a; upon measurement is 1, and the probability 
of obtaining any other value is 0. We will, with certainty, obtain aj. For 
states that are not eigenstates of A, however, the probability distribution 
will be non-trivial, and that is what our last postulate deals with.” 


2.3 Postulate 3: Quantum Superpositions 


If a measurement of the observable corresponding to the operator 
A is made on the normalized quantum state w, given by, 


p= > ntn, (2.7) 


where the W,,’s are eigenstates of A and the cn’s are expan- 
sion coefficients, then aj, the eigenvalue of Yj, will be obtained 
with probability lgl’. The system will be left in the state Y; 
immediately after the measurement. 


Here |c;|? is the complex square of cj, that is, |cj|? = cic; (see Appendix A). 

In a sense, Postulate 3 simply extends Postulate 2 to cases where the 
state of the system is not an eigenstate of the operator corresponding to 
the observable being measured. Nevertheless, the physical implications of 
Postulate 3 have probably given rise to more controversy than any other 
aspect of the foundations of quantum mechanics. Our discussion of Postu- 
late 3 is not intended to be rigorous, but rather to be sufficiently descriptive 
that you may begin to form a mental picture of the structure of quantum 
mechanics. 

Before developing Postulate 3 proper, we define a normalized quantum 
state. Normalization simply accounts for the physical requirement that the 
probabilities for all possible results of some type of measurement must add 


T The concept of probability is discussed in detail in Chapter 3. 
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up to one. In the context of Postulate 3, normalization amounts to imposing 


the condition: 
5 len|? =1. (2.8) 


If a state is not normalized, we impose normalization by multiplying all 
coefficients by a constant such that Eq. (2.8) is obeyed. 


2.3.1 Discrete Eigenvalues 


Suppose a system is in the quantum state Q, and I wish to calculate the 
results I would obtain if I measured either of two observables, A or B, cor- 
responding to the operators Aand B, respectively. Specify that eigenstates 
of A are not eigenstates of B, and vice versa. Moreover, the state of the 
system, 2, need not be an eigenstate of either Aor B. 

Pll call the normalized eigenstates of A the Ws, and those of B the 
gs. It turns out that the set of all normalized eigenstates of a Hermitian 
operator, such as the set of all Ws, or all és, forms a complete set of states. I 
will develop the critical concept of a complete set of states in considerable 
detail in Chapter 4—the important point for now is that any state may 
be “expanded” in terms of such a set. That is, any state may be written 
as a linear combination—a weighted sum—of such states. Thus, Q may be 
expanded as a linear combination of either the Ws or the ds: 


N 
Q = NO alias 
m=1 


Q 


N 
5 QAkỌk, (2.9) 
k=1 


where the summations run over all N states that comprise the complete 
set, and the cms and qs are called expansion coefficients. In quantum 
mechanics, a state which is expanded as a linear combination of (more 
than one of) the eigenstates of some operator is usually referred to as a 
superposition state, or just a superposition. 

Let’s look at the case for the observable B. The eigenvalue equation is 


Bo; = bjĝj, (2.10) 


where the bjs are the eigenvalues. But how do we deal with a state that is 
not an eigenstate of B? For example, what do we do with BQ? We need 
only expand 2 as a superposition of the eigenstates of B to see that 


BQ = BY > andr = Blards + azda +--+ avon) 
k=1 


= b101¢1 + boa2d2 +++» +anon. (2.11) 
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Well, we can write this out formally, but what does it mean? Postulate 
2 tells us, through eigenvalue equations, about measurements, but only on 
eigenstates of the relevant operator, not on superpositions of eigenstates 
of that operator. So, while Eq. (2.11) is mathematically correct, it doesn’t 
predict the results of measurements of the observable corresponding to B 
on the state 2. 

Postulate 3 tells us how to handle such cases. The result of any single 
measurement of the observable corresponding to B on the system state Q 
will be just one of the eigenvalues of B, not a “superposition” of eigen- 
values. The probability of obtaining, say, the jth eigenvalue of Ê , denoted 
Prob(b;), is given by the complex square of the expansion coefficient of the 
jth eigenstate of B. That is, Prob(bj) = aja; = |a;|?. Postulate 3 thus 
provides the means to construct a probability distribution for measurement 
of any observable with the system in any quantum state.® Finally, Postu- 
late 3 tells us that, after a measurement yields a particular eigenvalue, the 
system is left in the corresponding eigenstate. 

Simple as it is, Postulate 3 is one of the more remarkable statements 
in physics. It says that even if the system state cannot be characterized 
by only one eigenvalue, we will nevertheless obtain only one eigenvalue 
upon measurement—this is the root of the infamous measurement prob- 
lem of quantum mechanics. And although Postulate 3 provides a rule for 
determining the probability of obtaining each eigenvalue, it provides no 
explanation as to the physical source of these probabilities. Finally, we 
could hope to argue that these probabilities simply reflect changes in our 
state of knowledge, not in the physical system. But the postulate forecloses 
such attempts, by stating explicitly that the physical state of the system 
is indeed altered in accord with the measurement result. 


2.3.2 Continuous Eigenvalues 


To simplify things, I’ve limited our discussion to operators with discrete 
eigenvalues. The continuous eigenvalue case (which we will discuss in detail 
in Chapter 12) tends to be less transparent, and involves mathematical sub- 
tleties not essential to understanding the overall quantum framework. In 
both quantum and classical physics, position may take on a continuum of 
values. This is reflected in that particular type of quantum state often called 
a wavefunction. Because introductory students often have some familiarity 
with wavefunctions, but not with the more general concept of a quan- 
tum state, I now establish the connection between wavefunctions and our 
postulates. 

Although it’s not obvious, the following statement essentially combines 
Postulates 2 and 3 and restates them in terms of wavefunctions. 


8 Postulate 3 must be modified for use with mized states, also called mixtures. The 
mixed state concept is, however, rather advanced; I will not discuss it. 
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Let U(x) be the normalized wavefunction (quantum state) of a 
single-particle system. Then the interpretation of U(x) is that 
the probability of finding the particle, upon measurement, to be 
in the infinitesimal interval between x and x + dz is |W(a)|?dz. 
Equivalently, the probability of finding the particle in the finite 
interval (a,b) upon measurement is 


b 
f |W (x)|?de. (2.12) 


The particle will be left in the eigenstate corresponding to the 
measured value of x immediately after the measurement. 


In Postulate 3, the |c;|?s were probabilities. But |W(x)|? is evidently 
a probability density (of dimensions probability/length), which must be 
integrated to obtain a finite probability. This is similar to, say, calculating 
the total mass in some region of space. If the mass in the region comprises 
discrete objects, we simply sum their individual masses. But if the mass is 
continuously distributed, we must integrate the mass density (of dimensions 
mass/length®). 

Normalization now accounts for the physical requirement that, upon 
measurement, the particle must be found somewhere. This implies that 
if the probability density is integrated over the entire x axis, the result 
must be 1: 


co 
J |Y (x)| dz = 1. (2.13) 
= O 

Establishing a connection with the postulates hinges on three key points 
(justified in Chapter 12). First, each unique position, such as zo, corre- 
sponds to a unique eigenstate of the position operator ĉ; the corresponding 
eigenvalue is just xo itself. Second, the expansion coefficient for the eigen- 
state corresponding to xo is V(x), that is, the wavefunction evaluated at 
Xo. Finally, for a wavefunction, such as V(x), the operator ĉ corresponding 
to the position observable is simply x itself (that is, ĉ = x in this case).° 

Now suppose a system is in the eigenstate corresponding to the eigen- 
value zo. Then the only non-zero coefficient, also, corresponds to xo; that 
is, U(x) = 0 for x 4 xo. Thus, if a position measurement is made, the result 
will with certainty be xo. This is in accord with Postulate 2, which states 
that if a system is in the jth eigenstate of A, then measurement of A must 
yield the eigenvalue aj. 

What if the system is not in a position eigenstate? The expansion coef- 
ficient for each eigenvalue (each x) is U(x) itself, while the probability 
density is |W(a)|?. This is in accord with Postulate 3, which states that if 


9 This holds only because wavefunctions are generally written in the position 
representation—I discuss representations in Chapter 4. 
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a system is in a superposition of the eigenstates of A, the probability of 

obtaining the jth eigenvalue, aj, is the square of the jth coefficient, |c;|?. 
In case you are mystified by the complex square, I will write out a 

simple, specific example for you. Assume the quantum state is given by 


U(x) = Aet”, (2.14) 
where A is a (possibly complex) constant and i = /—1. Then 
|W(ax)|? = U(x) U(x) = A*e** Aet"? = |Al?. (2.15) 


Upon measurement, only one x value will result, but for this simple state 
all xs are equally likely to be found. 

Equation (2.12) states that the probability that measurement will find 
the particle in the interval (a,b) is obtained by adding up (integrating) the 
probabilities that the particle will be found at each point within that range. 
The same idea holds in Postulate 3: the probability that, say, either a, or 
az will be found upon measurement of A is just the sum |c1|? + |cs|?. 

An important complication that arises in connection with continuous 
eigenvalues is the nature of the eigenstates, which are both mathemati- 
cally abstract and physically unrealizable. Because this topic requires some 
mathematical sophistication to handle properly, I will simply sketch it here; 
a more detailed discussion appears in Chapter 12. 

I have written eigenvalue equations of the form, 


D ok = Pek: (2.16) 


This is an example of an eigenvalue equation for the case of discrete eigen- 
states and eigenvalues. But what about cases in which the eigenvalues and 
eigenstates form a continuum? For example, upon measurement, a particle 
could be found to be anywhere along the x axis. Thus, there must exist a 
continuous set of position eigenvalues, the xs, and the position eigenvalue 
equation is, 


av = rt. (2.17) 


The lack of subscripts implies the continuous nature of the eigenvalues. The 
probability is given by the continuous probability density: |Y (x)|?. 

Accompanying the continuous eigenvalues is a continuous set of eigen- 
states.'° Such states are described by Dirac delta functions, denoted 5(z). 
The basic properties of d(x) are 


d(a-—a)=0 if r#a 


Jo, O(a —a)de=1 if xı <a < zə. (2.28) 


10 Note that we are not referring here to the continuity of the eigenstates themselves, 
but rather to the continuity of the set of eigenstates. That is, rather than a discrete set 
of eigenstates, such as V1, Vo,..., Un, the set of states is continuous, so that it doesn’t 
make sense to give the states individual labels such as 1,2,...,n. 


Closing Comments 15 


The above is to be interpreted as meaning that 6(«—a) vanishes everywhere 
except at the point where its argument vanishes, and that at that point 
the integrated value of (x — a) is 1.'! Since the eigenstates of position are 
Dirac delta functions, the position eigenvalue equation for, say, £o, may be 
written as 


£6(a% — zo) = «0(x — zo) ĉô(x — £o) = zoô(x — Xo). (2.19) 


The second equation is valid because (x — zo) = 0 except at xo. 

If the Dirac delta function seems tricky, that’s because it is. In fact, the 
Dirac delta is not a proper mathematical function, and only many years 
after it was first used by physicists did mathematicians deem such use 
legitimate. 


2.4 Closing Comments 


It should now be clear that in quantum mechanics only probabilistic predic- 
tions are generally possible (the exception being if the state is an eigenstate 
of the relevant operator). But consider, for example, a wavefunction such as 
W(x). The probabilities are obtained not from W(z) itself, but from |W(x)|?, 
so that it is through |Y (x)|? that quantum mechanics makes contact with 
the physical world. So why not just work with |W(2)|?, and call that the 
quantum state? What’s the big deal? 

The big deal is that if the fundamental thing is taken to be |W(z)|?, 
rather than U(x), you absolutely won’t have quantum mechanics any longer! 
One way to see this is that the time-dependent Schrödinger equation—a 
fundamental law of nature—is a differential equation for Y, not for |W|?. 
Moreover, a complex function such as VW must, in general, embody more 
information than its real progeny, |W|?. Remember, we square V only upon 
measurement. Absent a measurement, we must work with Y itself, not 
[T (2)}. 

Before closing, a few words about terminology are in order. In classical 
physics, the displacement of a wave (or, often, its maximum displace- 
ment) is called its amplitude. In analogy, quantum-mechanical wavefunc- 
tions—and, by extension, all quantum states—are often called probability 
amplitudes. 

The square of a classical wave’s amplitude is of great physical interest, 
because it determines the rate at which the wave transmits energy. Simi- 
larly, probabilities are of great physical interest in quantum mechanics, and 
they are obtained by (complex) squaring probability amplitudes, that is, 
quantum states. 


11 A concise summary of important properties of the Dirac delta may be found in 
Jackson (1999), pp. 26-27. 
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The term superposition, also, is rooted in classical wave theory, wherein 
it refers to a sum of individual wave amplitudes. By analogy, a superpo- 
sition state (whether for continuous or discrete eigenvalues) is a quantum 
state that’s expanded as a linear combination—a weighted sum—of the 
eigenstates of some operator. 

A final note about terminology: I’ve been careful about referring to “the 
operator corresponding to some observable,” and vice versa. Such termi- 
nology is correct, but cumbersome, so authors sometimes don’t explicitly 
distinguish between observables and their corresponding operators. For 
example, you may see a phrase such as “measuring an operator,” even 
though what’s really measured is the observable corresponding to an oper- 
ator. Please keep in mind that the terms operator and observable are not 
quite synonomous. 

Well, that’s it. Although I’ve certainly left some gaping holes in the fab- 
ric, and IIl keep trying to fine tune our understanding of the postulates as 
we proceed, I’ve now laid out the basic structure that forms the foundation 
for all of quantum mechanics. 

Let me end by briefly summarizing where we’ve been. The fundamental 
entity in quantum mechanics is the quantum state, a time-dependent func- 
tion that is obtained by solving the time-dependent Schrödinger equation 
(with the appropriate Hamiltonian operator). Observable quantities are 
represented by Hermitian operators, and the result of a measurement of 
some observable must be one of the eigenvalues of the corresponding Her- 
mitian operator. If the system is in an eigenstate of the operator when 
measured, the result will, with certainty, be the eigenvalue corresponding to 
that eigenstate. If the system is in a superposition with respect to the mea- 
surement operator, the probability of obtaining any particular eigenvalue 
is the complex square of the expansion coefficient for the corresponding 
eigenstate. 

Innocuous though it may seem, the preceding paragraph describes one 
of the great creations of the human intellect. Attending that creation are 
some of the most profound and puzzling questions ever to have confronted 
that intellect. 


2.5 Problems 


1. Suppose an observable quantity B corresponds to the operator B= 
ah 


Sm qz- For a particular system, the eigenstates of this operator are 


W(x) = Asin, where: n = 1,2,3,... 


Here A is a normalization constant. 

(i) Determine the eigenvalues of B for this case. 

(ii) Determine the probabilities, and the corresponding measured 
values, for B if the system is in the state (x) = 0.81 Vo(x)+0.6 V4(x). 
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2. The ket |x’) represents the position eigenfunction corresponding to 
the position z’. Write down the position-space representation of |x’). 

3. A wavefunction 7)(x) constitutes a set of (choose one): 

(i) expansion coefficients (probability amplitudes); 
(ii) probabilities; 
(iii) position operators. 

4. Consider some momentum-space wavefunction ¢(p,t) (analogous to 
the position-space wavefunction W(x,t) in Section 2.3.2). What is 
the probability that a measurement of momentum at time t would 
result in a value between po and po + Ap ? (Hint: just write down an 
integral.) 

5. You have a lab in which you can create a “particle in a box”. By this 
we mean a particle that is subject to the potential: 


U =œ, xz<0Qland r> L, 


U = 0 elsewhere. 


Note that any quantum state vanishes identically for U = oo. The 
Hamiltonian operator H , which appears in the Schrödinger equation, 
is also the operator corresponding to energy. The unnormalized energy 
eigenstates for this system are given by 


n(x, t) = sin (Z5) exp(—iEnt/ħ) n=1,2,3,...,00. 


Here En is the nth energy eigenvalue (you will not need the explicit 
form of the Ens for this problem). 

(i) Determine the proper normalization factor (a constant) for these 
states. 

(ii) Sketch the potential, U. Then sketch the modulus of 7, (see 
Section 10.1.2) and Prob(x) for y1. 

(iii) Repeat Part (ii) for Y2. 

(iv) How does Prob(x) depend on time for pı and %2? 

(v) You have a “black box” in your lab, a state preparation device, 
that does a pretty good job of preparing the initial state of the system 
in the form of a rectangularly-shaped packet of width L/2, centered 
at x = L/2. 

(vi) Calculate the first five expansion coefficients (i.e. for n = 
1, 2, 3, 4, 5) for this state. 

(vii) Provide a clear, qualitative argument as to why the expansion 
coefficients for n = 2 and n = 4 behave as they do. 

(viii) Denote the superposition of the n = 1, n = 2, and n = 3 states 
$3(a,t), and of the n = 1 through n = 5 states as ¢5(z,t). 

(a) Show explicitly that ¢3 and ¢5 are not properly normalized. 

(b) Properly normalize ¢3 and ¢s. 
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(c) Clearly argue why ¢3 and ¢5 were not automatically normalized. 
(This will probably require some careful thinking!) 

(ix) Use a plotting program to plot Prob(x) for ¢3(a,t = 0); then do 
the same for s(x, t = 0). 

(x) What do you expect to happen as the number of terms in the 
superposition increases (i.e. as we include more w,s)? Can you see 
evidence for this in comparing v1, ¢3, and 5? 

(xi) We have discussed discrete eigenvalues, and in Chapter 2 we 
also discussed continuous eigenvalues a bit. Suppose a physics-student 
friend asks you, “Are the eigenvalues of ¢3 and ¢5 discrete or contin- 
uous?” You begin your answer with the statement “It depends on 
which eigenvalues you’re referring to.” Finish your answer by clearly 
explaining this statement to your friend. 

6. To answer the following questions, set up and solve the eigenvalue 
equation for the operator corresponding to momentum along the x 
axis, that is: ihe. 

(i) What are the eigenstates (i.e. functions of x) and eigenvalues of 
this operator. Is the set of eigenfunctions and eigenvalues discrete or 
continuous? 

(ii) Write down a superposition of two of the momentum eigenstates. 
Be sure that your superposition is normalized, that is, that the total 
probability equals 1. This condition arises from the fact that if you 
make a measurement—in this case, of momentum—you must get 
something as a result. 

(iii) What is the probability distribution for momentum for your 
superposition? 


3 


What Is a Quantum 
State? 


The assumption... is that if a student learns to solve problems, the proper under- 
standing will come naturally as he does so. I have met a number of intelligent 
physicists for whom this simply is not so—having in the course of time lost hold 
of the computational devices, they have lost all, and no instinct remains to guide 
them.+ 

David Park 


What does the formalism of quantum mechanics mean? How is that 
formalism related to the physical world? What is the meaning of the quan- 
tum state? These are deep questions, and although they are addressed in 
both popular discussions and in the vast literature on the foundations of 
quantum mechanics, in many ways they remain unanswered. 

Much of the purpose of this chapter—and indeed of this book—is to pro- 
vide a framework for quantum mechanics that leads to a coherent, clear, and 
durable understanding. Somewhat surprisingly, this can be done; arguably, 
it is not the approach usually taken. In this chapter I first introduce some 
simple but essential concepts of probability and statistics. These are then 
used to develop a particularly lucid approach to quantum mechanics, the 
statistical interpretation, which we will adopt for the rest of our quantum 
journey. I close with a brief detour of the historical and philosophical milieu 
that was spawned by the sort of fundamental questions posed above. 


3.1 Probabilities, Averages, and Uncertainties 
3.1.1 Probabilities 


Probability is the fundamental stuff of quantum mechanics. Why? Because 
quantum-mechanical predictions must take the form of probabilities—we 
generally cannot predict the outcome of a quantum-mechanical experiment 
with certainty. The end-goal of many quantum-mechanical calculations 
is, therefore, to obtain probabilistic predictions for the results of mea- 
surements. To understand quantum mechanics, then, you'll need a little 
knowledge of probability and statistics—but only a little. 


1 Park (1964), p. vi. 
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Since the quantum world often behaves in strange and unexpected ways, 
so do quantum probabilities. But although the behavior of quantum prob- 
abilities may be puzzling, their statistical meaning is precisely the same as 
that of “garden-variety” probabilities, such as the probability of obtaining 
“heads” in a coin toss, or of purchasing a winning lottery ticket. Only a few 
basic concepts of probability and statistics will be needed to develop quan- 
tum mechanics. If you understand these concepts in simple, unmysterious 
applications, you understand them in quantum mechanics. The quantum 
world is subtle enough as it is—don’t make it more so by investing these 
concepts with undue mystery just because they appear in the context of 
quantum mechanics. 

Probabilities are the raw material out of which other statistical con- 
cepts, such as averages and uncertainties, are built. But what is a 
probability? We need not delve into foundational issues here;? but we do 
need a practical, working definition. 

Consider the following simple situation. A die has six faces. On each 
face are from one to six dots, each number of dots appearing only once. 
If I roll the die, what is the chance of obtaining a particular number of 
dots, say three? Because there appears to be no reason for the die to land 
preferentially on any face, I take each possibility as equally likely. I thus 
predict the chance of obtaining three dots on any roll to be t. 

What I’ve called the chance is really the probability; clearly, any proba- 
bility must lie in the interval [0,1]. Suppose G is some measurable quantity, 
Go is a particular value of G, and Prob(Go) is the probability of obtaining 
Go. If Prob(Go) = 0, then Go never occurs; if Prob(Go) = 1, Go occurs 
with certainty (always). 

How do we know that the predicted probability in our example, Z, is 
accurate? We test it experimentally, by repeatedly rolling the die. Say we 
roll it N times. If our prediction holds, we will find that as N becomes 
large, we will obtain three dots on approximately i of the rolls, and this 
approximation will generally become better as N increases. In fact, as N — 
co we expect that the number of times we obtain three dots will approach 
exactly N/6, in accord with the predicted probability. 

Clearly, the probability of obtaining any one of the six possible dot 
configurations is L If we add up all of these probabilities—that is, if we 
take account of all six possible dot configurations, and thus all six pos- 
sible results of a roll—we obtain a total probability of 1. This makes 
perfect sense: The probability that we will obtain some configuration on 
any roll—that something will happen—must be 1. This simple idea, which 
is rooted in probability (not in quantum mechanics per se), is the rationale 
for normalizing the quantum state. 

In this example we used our knowledge of the situation to predict 
the probability of obtaining three dots. We then tested our prediction 


2 Such issues are briefly discussed in Ballentine (1998), pp. 31-33. 
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experimentally. Instead we could have simply run our experiment first, 
and then used the experimental results to surmise that the probability of 
obtaining three dots is z. We can connect these two approaches formally. 

Suppose J is one possible outcome for each run of some experiment. I 
run the experiment N times, with N a very large number. If J is obtained 
N; times, then I might write Prob( J) = N;/N. Really, though, the exper- 
imental determination of a probability would require that N — oo (which 
is, of course, unobtainable in practice). Then I would have 


Jim Ny/N = Prob(J), (3.1) 


which connects my experimental results with the idealized concept of a 
probability. 

The collection of all probabilities associated with the various possible 
results of a measurement is the probability distribution. In the example of 
the rolled die, there are six distinct possibilities: the distribution is discrete. 
Suppose, however, that I also made repeated measurements of the time 
required for the die to come to rest after I released it. We would expect 
that these times could take on a continuous range of values. Suppose the 
predicted probability distribution for time, denoted P(t) (with dimensions 
probability /time), takes the form of a Gaussian 


P(t) ~ exp? to)” (3.2) 


(Note that to is the most probable time.) This is simply an assumption,? 
but a useful one: Gaussians possess nice mathematical properties, and they 
arise frequently in mathematics and physics. 

Figure (3.1) illustrates both the discrete distribution for the number 
of dots obtained in a die roll, and the continuous distribution P(t). For 
simplicity I will frame much of our discussion of quantum mechanics in 
terms of discrete distributions, but in Chapter 12 I will discuss continuous 
distributions in detail. 

Finally, you should be aware of something that can cause much unnec- 
essary consternation. Presumably, if we had sufficient information about 
any given roll of the die, and sufficient calculational ability, we could pre- 
dict with certainty the dot configuration resulting from a roll. Thus, our 
use of probability really just reflects our lack of knowledge, our ignorance. 

You may read elsewhere that things are much different in quantum 
mechanics, that quantum probabilities are not just reflections of our igno- 
rance, but are instead deep, mysterious, and fundamental, revealing the 
operation of chance at nature’s deepest levels—but is this true? This 
question is simply unanswerable at present (despite what anyone may 
claim). However, you need not worry about such questions to learn quantum 


3 This can’t be exact, because it allows for t < 0, which is physical nonsense. 
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Fig. 3.1 (a) Discrete probability distribution for number of dots obtained 
in a die roll; (b) continuous probability distribution for time required to stop 
rolling. Here the most probable time, to, is 2.5. 


mechanics. Regardless of any metaphysical meaning attached to quantum 
probabilites, the fact is that, in an operational sense, they are just prob- 
abilities—not unlike the probability of obtaining three dots when a die is 
rolled. 


3.1.2 Averages 


The notion of average, in the sense of “typical”, is familiar from everyday 
life. But we'll need a quantitative definition. Fortunately, this definition 
is fairly transparent, and (unlike some mathematical or scientific terms) 
closely related to its colloquial usage. 

Suppose I make N measurements of some quantity T, with resulting 


values T1, T2,..., TẸ. Then the average, or mean, of these measurements is 
simply: 
IA 
(T) = ain (3.3) 
j=l 


where I have used the common notation (T) to denote the average of T. 
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I could also calculate the theoretical average value of T—the value that 
I would expect to obtain if I made (very) many measurements. Suppose 
that a (single) measurement of T must yield one of a set of 6 possible 
values. If I know the probabilities for obtaining each of these 8 values, then 
the expected average value of T is 


B 
(T) = X Prob(Ty)Tp. (3.4) 
k=1 


It may not be obvious why Eq. (3.4) should hold, or how it’s connected to 
Eq. (3.3). Equations (3.3) and (3.4) differ because Eq. (3.4) is the ideal, 
expected average in terms of theoretical quantities (probabilities), while 
Eq. (3.3) is the average of a finite set of N actual measurements. Suppose 
the value Tj = a occurs Na times in our N measurements. Each time it 
occurs, a term of the form a/N appears in the sum in Eq. (3.3). The sum 
of all such terms is: (Na/N)a. In the limit N — oo, we have (Na/N)a > 
Prob(a)a (cf. Eq. (3.1)). If we do this for all T values, then Eq. (3.3) 
becomes Eq. (3.4). 

If T is a continuous variable, we have a probability distribution (or 
density), denoted p(T), with dimensions probability/T. The sum becomes 
an integral, the index k becomes superfluous, and Eq. (3.4) takes the form 


Tmax 
(T) = | o(TYT dT. (3.5) 


Tmin 


Here Tmin and Tmar are the minimum and maximum values that T may 
assume. 

In quantum mechanics, an ideal, expected average value—such as (T) 
in Eq. (3.4) or (3.5)—is usually referred to as an expectation value. Thus, 
the expectation value is a calculated (rather than measured) average. The 
term expectation value is slightly misleading because, despite its name, 
we do not, in general, actually expect to obtain the expectation value in 
any given measurement. For example, if the only two possible results of a 
measurement are +1 and —1, and if each result is equally probable, then 
the expectation value is zero. But we will never obtain this expectation 
value in any measurement. (Similarly, in any given roll of the die in Sec. 
3.1.1, we will never obtain the expectation value for the number of dots. 
Calculate it!) 

For discrete eigenvalues, the quantum expectation value corresponds to 
Eq. (3.4). Given, for example, the eigenvalue equation Bo; = b;¢;, and 
the state Q = 5°, a;¢; (from Sec. 2.3.1), the expectation value (B) is of 
the form 


(B) = 2 Prob(b;)b; = D3 la;|?bi. (3.6) 
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In the context of wavefunctions—which is how students often first 
encounter quantum-mechanical expectation values—we have 


b 
(a) = f W(x)* AU (x) dz, (3.7) 


where the observable is A (corresponding to the operator A), and the sys- 
tem is in the state U(x). Here the interval (a,b) is the region within which 
W(x) may be non-zero (and thus, within which the particle may be found 
upon measurement). Clearly, the limits of integration could be extended to 
(—o0, 00) without affecting the integral. 

Consider (x), the expectation value for position. In this case it’s easy to 
see that (x) takes the form of Eq. (3.5). From Section 2.3.2, we know that 
if ĉ acts on W(x), then ĉ = x. Thus, for (x), Eq. (3.7) may be written as 


(x) = [. U(x)*aU(x)de = L U(x)*U(x)adx = E |U (x)|?zdz. 
(3.8) 


We know from Section 2.3.2 that |U (x)|? is a probability density, as is p(T) 
in Eq. (3.5). So there’s a straightforward connection between Eq. (3.5)—a 
result from probability theory—and the quantum expectation value (2). 
The crucial step in Eq. (3.8) was taking V(x) = Y(x)r—a mathematically 
trivial operation, but only because ĉ = x. It was this step that led to the 
connection with Eq. (3.5), through the analogous roles of |W(a)|? and p(T). 

If we consider Eq. (3.7) for some A that—unlike ĉ in Eq. (3.8)—must 
be treated as an operator, rather than as a variable, it’s no longer obvious 
how to interpret U(«)*AW(x). The simple link to Eq. (3.5) is thus lost. 
Even in such cases, however, Eq. (3.7) still holds.* 


3.1.3 Uncertainties 


You'll hear much about uncertainty in quantum mechanics. But like proba- 
bilities and averages, the concept of uncertainty is fundamentally statistical, 
not quantum-mechanical.° 

The expectation value (i.e., the average value) provides important infor- 
mation about a probability distribution—but there’s much that it doesn’t 
tell us. Consider two possible probability distributions, denoted a and 
b, of some variable: Proba(+1) = Proba(—1) = 1/2, and Prob(+5) = 
Proby(—5) = 1/2. (See Fig. 3.2.) It’s easy to see that although the distri- 
butions share the same expectation value ((a) = (b) = 0), they are much 


4 See, for example, Bohm (1951), Sections 9.5-9.6; Schiff (1968), pp. 27-28. See also 
Section 4.4.3. 

5 My discussion of uncertainties here will be minimal; more detailed treatments are 
readily available. See, for example, Taylor (1997). 
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Fig. 3.2 Two non-identical probability distributions with identical expec- 
tation values ((a) = (b) = 0). 


different, as is evident from their different “spreads”. Clearly, we could eas- 
ily generate an unlimited number of widely varying distributions, all with 
expectation value zero. 

Uncertainty provides a means of characterizing the spread of a distri- 
bution. The most straightforward (and conventional) way to do this is by 
defining the uncertainty of a distribution to be its standard deviation, o. 
For a set of N measurements of a quantity T, the standard deviation is 


VL; - (T)? 
o(T) = i N ; (3.9) 


Evidently, o(T) is the root of the mean of the squares of the deviations 
from (T); the standard deviation is thus also called the root mean square 
deviation. 

To calculate uncertainties theoretically, we must cast the standard devi- 
ation into probabilistic language. Take T;, to represent the set of all M 
theoretically possible measurement results. Then ( (Th — (T))*) is the mean 
of the square of the deviations of the Tks from (T’), and Eq. (3.9) becomes 


o() = (Tk — T) = i Yuen (Te - ane) + M(T)? 


= f(T) — (TP. (3.10) 


If you use Eq. (3.10) to calculate the uncertainties for the two distributions 
in Fig. (3.2), you can see that ø does indeed characterize their spreads (try 
it!). But these distributions are obviously very simple—so what does the 
uncertainty tell us about the spread of an arbitrary distribution? 
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Use of the term “uncertainty” in an experimental context usually refers 
to an inability to make exact measurements. This inability manifests itself 
in the spreading out of a set of measurements around an average mea- 
sured value. Typically, the distribution of measurement results approaches 
the form of a Gaussian, or normal, distribution (as in Fig. 3.1(b)). You 
may have learned elsewhere that the standard deviation (the uncertainty) 
represents the 68% “confidence level” —that we expect 68% of our measure- 
ments will lie within one standard deviation of the average. This is correct, 
if the distribution is indeed Gaussian, as we expect for most experimen- 
tal uncertainties. But we really can’t say much about the meaning of the 
uncertainty for an arbitrary distribution except that, roughly speaking, a 
larger uncertainty means a more spread-out distribution. 

Because quantum states (and their corresponding probability distribu- 
tions) generally are not Gaussians, we can’t easily characterize the degree 
of spreading represented by the uncertainty in quantum mechanics. Never- 
theless, uncertainties remain useful in quantum mechanics, partly because 
it is often the relation between the uncertainties in different distribu- 
tions, rather than the uncertainty of any individual distribution, that is of 
interest. 

Probabilities, averages, and uncertainties—these are all the statisti- 
cal concepts needed to develop basic quantum mechanics, and I again 
emphasize that they are fundamentally statistical in nature, not quantum- 
mechanical. Having introduced these basic statistical concepts, I now 
introduce the statistical interpretation of quantum mechanics. 


3.2 The Statistical Interpretation 


Most texts adopt—implicitly or explicitly—the so-called Copenhagen inter- 
pretation of quantum mechanics. Like other, conflicting interpretations, 
Copenhagen is an attempt to tell us what’s really “going on”—to inform 
us of the quantum world beyond the formalism itself. 

By contrast, the statistical interpretation arguably is not an interpreta- 
tion, but a broad framework that describes how quantum mechanics works 
in actual practice. You may worry that by learning quantum mechanics 
from this perspective, you'll be at a disadvantage. On the contrary, the 
statistical interpretation provides an understanding one can have confi- 
dence in: because it’s “just” a framework, it remains compatible with other 
approaches, such as Copenhagen, while avoiding many conceptual puzzles 
that arise within them. 

But what is the statistical interpretation? In a sense, it simply amounts 
to the following edict: take seriously what quantum mechanics does tell us, 
and don’t take seriously what quantum mechanics doesn’t tell us. 

So, what does quantum mechanics tell us? First, it provides a pre- 
scription for calculating the quantum states associated with a physical 
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system, including how these states evolve in time. And, if we can calculate 
the quantum state that properly describes a particular system, quantum 
mechanics tells us how to obtain the probability distributions for the results 
of measurements performed on that system. 

That last paragraph could stand lots of unpacking. For now, however, 
I only wish to make a simple but crucial point. I will do so with a concep- 
tual example that is essentially devoid of mathematics. Consider a system 
with only two possible energies, FE, and EF. In general there is an infinite 
ensemble (set) of quantum states that could describe the system. However, 
for a particular physical situation, the system must be described by one 
particular quantum state,” which PI denote W. Suppose now that we have 
correctly calculated the quantum-mechanical state W, which then yields 
the following probability distribution for energy measurements on the sys- 
tem: Prob(£,) = 3, and Prob(E2) = 3. So far, we’ve simply followed the 
quantum prescription: 


e calculate a quantum state that correctly describes the system of 
interest, 
e use the quantum rules to obtain the probability distribution. 


But what does this probability distribution mean in a real, practical sense? 

Imagine we have an ideal energy measuring device in our laboratory. 
Quantum mechanics never predicts that we will obtain some intermediate 
value in any given energy measurement (see Section 2.3). Thus, the energy 
probability distribution becomes manifest not in a single measurement, but 
only in a distribution built up from an ensemble of measurements. Just as 
we never obtain a value between heads and tails in an individual coin toss 
(assuming that the coin never lands balanced on its edge), we never obtain 
a value between FE, and E> (such as, say, iE + 2 Fo) on any individual 
energy measurement: we always obtain either EF, or Ez. As the number 
of measurements, N, increases, we obtain FE, in approximately 3 of our 
measurements, and Es in approximately 2 of our measurements. And as 
N — œ, these approximations become better. All of this is precisely what 
we expect on the basis of our discussion of non-quantum probabilities in 
Section 3.1.1. 

In the laboratory, then, we generally must make many measurements 
before the probability distribution becomes manifest. And although our 
example dealt with energy, this holds regardless of what quantum observ- 
able one measures—energy, momentum, angular momentum, or position. 
Moreover, it holds regardless of interpretation. Copenhagenist or not, very 


6 This is not to say that the prescription is easily filled. One may face formidable 
challenges, both in terms of determining the correct Hamiltonian to use, and in terms 
of carrying out the prescribed calculations, that is, solving for the quantum state. 

7 If we considered mixtures, this statement would require modification (see 
Section 2.3.1). 
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few physicists would disagree with this “many measurements” view of 
experiments. 

Now it’s one thing to “know” something, and often quite another to 
intellectually digest it until it becomes a mode of thought. Perhaps what 
I’ve said seems simple enough: Treat the quantum probabilities like non- 
quantum probabilities, in that they become manifest through repeated 
measurements. But this point is so fundamental that it’s worth restating. 


The fundamental role of quantum mechanics is to provide a 
means to calculate possible measured values, and the probabil- 
ity distributions corresponding to those values, for measurements 
performed on physical systems. These probabilities become evi- 
dent only in an ensemble of measurements, never in a single 
measurement. 


And just as probabilities become manifest in an ensemble of measurements, 
so do averages and uncertainties. In the laboratory, these quantities arise 
through application of Eqs. (3.3) and (3.9) to an ensemble of measurements, 
never to a single measurement. 

Note carefully, however, that quantum mechanics tells us nothing about 
measurable quantities absent in a measurement. Take our example: quan- 
tum mechanics proper simply does not address questions such as, What is 
the system’s energy between measurements? Is the concept of energy unde- 
fined except when measured? Why, when measured, is the energy always 
Ey or E? 

Such questions are deeply interesting and important. For example, if 
we impose completeness (see Section 2.1), then the quantum state must 
somehow contain within it everything that can, in principle, be known 
about the system. Thus, the state itself must somehow provide answers to 
our questions—yet none are forthcoming. In the statistical interpretation, 
we choose to simply ignore such questions. And in the end they remain 
unresolved, for our instruction set—the quantum formalism—provides no 
answers. None are needed, however, to pursue our desideratum: developing 
a clear and coherent means to think about quantum mechanics. 

In the statistical interpretation, then, we generally refrain from looking 
beyond quantum mechanics’ operational role as a means of calculating 
possible measured values and the corresponding probability distributions. 
And these quantum probabilities become manifest in the same manner as 
non-quantum probabilities: in an ensemble of measurements of the quantity 
of interest. 


3.3 Bohr, Einstein, and Hidden Variables 
3.3.1 Background 


The interpretation of quantum mechanics has been a contentious issue in 
physics for decades. In older books one typically finds a matter-of-fact 
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acceptance of the Copenhagen interpretation. But in recent years interest 
in other interpretations of quantum mechanics has enjoyed a renaissance, 
and researchers in the field generally believe that the correct interpretation 
remains an open question. This has, in some measure, provided the impetus 
for my approach: regardless of any unresolved metaphysical questions, we 
can agree on the operational meaning of quantum mechanics as embodied 
in the statistical interpretation. 

I will not discuss the historical, philosophical, and technical aspects 
of the interpretation of quantum mechanics in any depth—to do so could 
easily fill a book in itself. I hope only to provide some background, and to 
dispel some persistent myths. 

The formalism of non-relativistic quantum mechanics was completed 
during 1925 and 1926. Also in 1926, Max Born showed how to obtain proba- 
bility distributions from the quantum formalism. This essentially completed 
what I have called the statistical interpretation of quantum mechanics. 
Operationally, this is sufficient, but it doesn’t tell us what’s really going 
on—what the quantum formalism, particularly the quantum state, means 
physically. And it’s the task of attaching physical meaning to the math- 
ematical formalism of quantum mechanics that has been the focus of so 
much debate. 

The 1927 Solvay Conference, in Brussels—a meeting of the great physics 
luminaries of the time—marks a turning point. There, Niels Bohr and 
Albert Einstein engaged in a famous, extended debate about the inter- 
pretation of quantum mechanics. Bohr espoused what would soon become 
the almost universally accepted view, Einstein vigorously calling that same 
view into question. 

The Bohr—Einstein debate is often portrayed as a decisive defeat for 
Einstein—an aging physicist who simply could not abandon his antiquated 
philosophical commitment to causality, and thus stubbornly opposed quan- 
tum mechanics until his death nearly thirty years later. But history, 
including history of science, is often not so simple as its folklore rendition, 
and this portrayal of the debate is open to serious questions. 

Nevertheless, the belief that Einstein had been defeated was widespread, 
and this contributed to the rapid rise to dominance of Bohr’s ideas, and 
what came to be known as the Copenhagen interpretation. After the 
Solvay Conference, very few physicists seriously questioned Copenhagen. 
Those who did were marginalized, largely being viewed as the old guard, 
incapable of effecting the revolution in thinking required by quantum 
mechanics. 

There are various versions of the Copenhagen interpretation.® But an 
insistence on completeness, which implies that the quantum state contains 


8 So, just what is the Copenhagen interpretation? This question, including historical 
background, is discussed from disparate perspectives in Cushing (1994), Chapter 3, and 
Howard (2004). 
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all information about the system it describes, is common to all of them. 
Even in 1927 completeness was gaining acceptance: 


We maintain that quantum mechanics is a complete theory; its 
basic physical and mathematical hypotheses are not further sus- 
ceptible of modifications.’ 

(Werner Heisenberg and Max Born) 


This would be a remarkably forceful assertion to make about even a mature 
scientific theory. Heisenberg and Born did so when quantum mechanics, as 
a coherent theory, was only some two years old. 

Completeness is clearly incompatible with the existence of any infor- 
mation other than that contained in quantum mechanics itself. Thus, an 
insistence on completeness rules out the existence of hidden variables that 
contain information beyond that embodied in the quantum state. The pre- 
cise nature of such variables is, for our discussion, unimportant. The point 
is that because hidden variables theories are inconsistent with complete- 
ness, the existence of hidden variables would invalidate the Copenhagen 
interpretation. 


3.3.2 Fundamental Issues 


Here I briefly discuss what are likely the three most important topics in 
foundations of quantum mechanics. The first two are problems, in the sense 
that no consensus has been reached as to how, or even whether, they will 
be resolved. They serve to illustrate that major interpretational problems 
remain in quantum mechanics. The last topic, nonlocality, is not so much 
a problem as a profound result. I introduce nonlocality here because of its 
fundamental importance, but also to warn against reading too much into 
this remarkable result. 

The Measurement Problem. The measurement problem, which arises 
from Postulate 3 (Section 2.3), is one of the most famous and stubborn in 
quantum mechanics. Often it is characterized as “the collapse of the wave 
function,” indicating that when a measurement is performed, the result will 
be one of the eigenvalues of the measured observable. But this description 
fails to convey the depth of the difficulty. That the wave function (quantum 
state) collapses when a measurement is performed is indeed surprising, but 
it is not obviously inconsistent. The real crux of the measurement problem 
is not simply that collapse occurs, but rather how it occurs. 

It can be shown (without undue difficulty) that collapse cannot occur 
due to “regular” time evolution by the Schrödinger equation (as in Pos- 
tulate 1). That is, we must suspend the Schrödinger equation during a 


9 From a lecture delivered at the 1927 Solvay Conference; quoted in Jammer (1974), 
p. 114. 
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measurement, so as to effect the collapse. But must not a measurement sim- 
ply be a particular kind of physical interaction—different in details, but not 
in kind, from non-measurement interactions? What, then, can justify “turn- 
ing off” the Schrödinger equation only during a measurement? As John Bell 
asked, “are we not obliged to admit that more or less ‘measurement-like’ 
processes are going on more or less all the time more or less everywhere? Is 
there ever then a moment when there is no [collapsing] and the Schrödinger 
equation applies?” !° This, more so than collapse per se, lies at the heart of 
the measurement problem. 

The Classical Limit. Special relativity is a more fundamental theory 
than Galilean relativity, so that Galilean relativity should emerge from 
special relativity in some appropriate limit. And indeed, this is just what 
happens: for velocities much smaller than light speed, special relativity 
reduces to Galilean relativity.!! The classical limit of quantum mechanics 
refers to the belief that quantum mechanics is a more fundamental theory 
than classical mechanics, so that classical mechanics should emerge from 
quantum mechanics in some appropriate limit. 

Ideally, this emergence would result from some simple limiting proce- 
dure, as for relativity. Alas, no generally applicable route to the classical 
limit has ever been found. Despite what some textbooks suggest, it is not 
clear how, or even if, the classical limit can be realized. It now appears 
that any resolution of the classical limit problem will require, at the least, 
both a limiting procedure and accounting for the interaction of quantum 
systems with the environment, through a sophisticated approach known as 
decoherence. 

Non-locality. If a theory is local, there are no superluminal interactions 
(i.e. interactions that travel at faster than light speed) between spatially 
separated systems.'? In 1964 John Bell published his renowned name- 
sake theorem, which showed that any hidden-variables theory that obeys 
locality must make (some) predictions that are inconsistent with those 
of quantum mechanics. Experimental tests confirmed quantum mechan- 
ics, effectively ruling out local hidden-variables theories. But even though 
local hidden-variables theories conflict with experiment, couldn’t non-local 
hidden-variables theories still be valid? 

To address this question, an argument like the following is some- 
times advanced. In the wake of experimental tests of Bell’s theorem, the 
choice is between standard (Copenhagen), local quantum mechanics, and 
non-local hidden-variables theories. But because non-locality violates spe- 
cial relativity (i.e. no signals may travel faster than light speed), which 
has been convincingly confirmed, non-local hidden-variables theories are 
unacceptable. Thus, all hidden-variables theories fail, local or non-local. 


10 Bell (1987), p. 117. 
11 Actually, this oft-repeated statement isn’t quite true; see Baierlein (2006). 
12 Some authors define locality somewhat differently. 
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This argument is problematic. The choice between a local and non- 
local theory is illusory—because standard quantum mechanics, also, is non- 
local. Says Tim Maudlin, “Stochastic [Copenhagen-type] theories involve an 
obvious element of superluminal causation. Deterministic [hidden-variables] 
theories were the only hope for locality from the beginning, a hope Bell 
extinguished [italics added].”!3 Or as James Cushing put it, “One of the 
central lessons we may draw from Bell’s theorem and from the analysis 
resulting from it is that such nonlocality appears to be a feature of our 
world, not just of this or that theory of physical phenomena. That being 
the case, nonlocality itself gives us little reason to choose Copenhagen over 
[hidden variables].” 14 Bell’s theorem is simply mute in the debate over the 
validity of non-local hidden-variables theories—the correct interpretation 
of quantum mechanics remains an open question. 

Even though it cannot resolve the hidden-variables debate, Bell’s the- 
orem remains of great interest—non-locality is a profoundly important 
discovery. “For those interested in the fundamental structure of the physical 
world,” writes Maudlin, “the experimental verification of violations of Bell’s 
inequality constitutes the most significant event of the last half-century.” 1° 


3.3.3 Einstein Revisited 


Because of his opposition to the Copenhagen interpretation, Einstein was 
widely regarded as out of touch with the new physics, unable to adapt to 
the bold ideas of quantum mechanics. You may hear that Einstein could 
never accept quantum mechanics, despite its overwhelming success, and 
despite the compelling arguments in its favor, such as those advanced by 
Bohr at the 1927 Solvay Conference. 

But Einstein did accept the predictive validity of quantum mechan- 
ics—that it provides a correct description of the world. In essence, he 
accepted the statistical interpretation. What he did not accept, what 
his objections were rooted in, was an insistence on quantum mechanics’ 
completeness. 

Regarding the fundamental role of probability in quantum mechanics, 
Einstein famously stated that “God does not play dice”—to which Bohr 
reportedly responded: “Stop telling God what to do.”!® And many still 
seem to regard Einstein’s remark as a self-inflicted wound: an indication of 
his stubborn unwillingness to change, his lamentably archaic attitude in the 
face of the new physics. But Einstein’s insistence that chance cannot be a 
fundamental principle of nature was no more dogmatic than the insistence 
of so many of his contemporaries that it must be. Perhaps a worthy retort 
to Bohr would have been: “Stop telling God he must play dice.” 
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Maudlin (1994), pp. 138-139. 
Cushing (1994), p. 47. 
Maudlin (1994), p. 4. 
Calaprice (2000), pp. 251-252. 
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Evidently Einstein did object to the notion that chance operates at the 
most fundamental level of nature. But a far deeper objection was not to 
chance, but to physical unreality, which, Einstein argued, arises from the 
insistence on completeness. 17 

The attitudes of Einstein, and those few like him, towards quantum 
mechanics are often seen as archaic and outmoded. But one could also adopt 
a sympathetic perspective. Facing a tide of scientific opinion which had 
turned decidedly against them, these brave souls persevered. And Einstein, 
in particular, seemed unperturbed by it all. Near the end of his life, he 
wrote that... 


Few people are capable of expressing with equanimity opinions 
which differ from the prejudices of their social environment. Most 
people are even incapable of forming such opinions. 


This indictment was leveled at the community at large, but might it also 
have been directed at the community of scientists? Einstein’s words form 
a fitting if unintentional tribute to their author—a man who never relin- 
quished his intellectual integrity or independence, despite the prejudices of 
his scientific environment; a man who grew estranged from the scientific 
community, but never from science.'® 


3.4 Problems 


1. Suppose the possible measured values of r for some quantum system 
are: 
k 
Tj = -z3 (17, 
where k is the index that appears in the cs, the expansion coeffi- 
cients. You have an excellent lab at your disposal, operated by highly 
competent experimental physicists. 
(i) As a preliminary test, the lab prepares systems with the following 
expansion coefficients: 


= 1. = yf 5: — 3i 
C1 = JE’ C2 75> © 15° 


Calculate (r), the expectation value of the quantity r, in the above 
state. 

(ii) Your experimentalist colleagues prepare 75 systems in the above 
state, and measure r on each of them. They obtain rı in 6 runs, r2 in 


17 See Ballentine (1972). A fascinating correspondence amongst Einstein, Max Born, 
and Wolfgang Pauli sheds light on this topic. See: Born (1971), pp. 205-228. Note that 
“statistical interpretation” is used therein in a much different sense than I have used it. 

18 Quoted in Einstein (1984), p. 102. 
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22 runs, and rs in 47 runs. Calculate the experimental average, and 
compare it to (r) above. 

(iii) In the actual experiments you wish to run, the situation is not 
so simple. Now there may be many expansion coefficients. Moreover, 
you cannot afford to make simple calculational errors. Write a program 
(using Fortran, MATLAB, etc.) to calculate (r) from a known set of 
coefficients (which you enter into the program). 

(iv) Your experimentalist colleagues prepare the systems with the 
following coefficients. 


Sa pS. — 2 er ae Sa ea 
C1 = y 50 C4 = \/ 50 C7 = iy 5g ClO= 59 A = —*/ 50 
1 / 6 . J3 | 8 
C2=—1/59 = 41/50 C8 = 50 ĉi ZUW 50o «(C14 = 50 
5D — FB _ /2 eae ay lof 
CIS 50 C6 =4/ 50 C9 = 50 C12 = 4/ 56 C15 = 50 


Now run your program with these coefficients. (You can use Part (i), 
above, to check it.) It’s easy to see the power (and accuracy, if you 
write your program correctly!) of using a computer to calculate such 
things, rather than calculating “by hand”. 

2. Consider a normal, six-faced die. Each face has from one to six dots 
on it (each number of dots appearing only once). On any die roll, each 
face has an equal chance of coming up. 

(i) Calculate < dots >, the expected (theoretical) average number 
of dots. 

(ii) Suppose you roll the six-faced die N times, and obtain the 
following results. 


No. of dots No. of times obtained 


One N/6 
Two N(3/24) 
Three N(3/16) 
Four N/6 
Five N(7/48) 

Sie N(5/24) 


Calculate < dots >, the average measured number of dots. Should 
this quantity be called an expectation value? How about < dots > 
from Part (i)? Should the quantities in the right-hand column above 
properly be called probabilities? 

(iii) Calculate the uncertainty for the measurement results of Part (ii). 
(iv) Calculate the theoretical uncertainty for the system of Part (ii). 
Does your result seem reasonable? Does it agree with the result of 
Part (iii)? Why, or why not? 
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(v) The Cosa Nostra casino in Las Vegas uses “loaded” dice. The 
probability distribution for such a die is: 


No. of dots Probability 


One 5/24 
Two 1/6 
Three 1/8 
Four 1/8 
Five 1/6 
Six 5/24 


Calculate < dots > for such a die. Also calculate the uncertainty. 
(vi) Plot the probability distribution for the die of Parts (i-iv). Show 
the uncertainty, also. Then do the same for the die of Part (v). 


A 


The Structure of Quantum 
States 


Before I had studied Zen for thirty years, I saw mountains as mountains and 
waters as waters. When I arrived at a more intimate knowledge, I came to the 
point where I saw that mountains are not mountains, and waters are not waters. 
But now that I have got its very substance I am at rest. For it’s just that I see 
mountains once again as mountains, and waters once again as waters. 


Ch’ing- Yuan! 


Quantum mechanics may at first seem deceptively simple. But as its 
detailed structure is unmasked, it becomes complex and mystifying. In 
time, an inner simplicity is revealed—one arising now not from ignorance, 
but from insight into quantum mechanics’ fundamental structure. 

Perhaps you now feel that you’ve acquired a good grasp of the concept of 
a quantum state. But grasping the essence of quantum mechanics requires 
a deep understanding of the state, and the simple picture presented so far 
is insufficient to develop such an understanding. 

This chapter first provides the requisite mathematical background, and 
then introduces Dirac notation—an immensely useful and elegant language 
with which to represent states. The power of Dirac notation begins to reveal 
itself as the scalar product—a pervasive concept in quantum mechanics—is 
developed. Finally, the key quantum concept of a representation is explored. 
Only when the relations amongst eigenstates, superpositions, and represen- 
tations are understood can the fundamental structure of quantum states 
be revealed. 


4.1 Mathematical Preliminaries 
4.1.1 Vector Spaces 


Our goal is the development of a working understanding of quantum 
mechanics, rather than a rigorous mathematical exposition. To bet- 
ter understand quantum states, however, we’ll need some fundamental 
mathematical concepts—these I present in this section and the next. 


1 Watts (1989), p. 126. 
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Although elementary vectors such as force and momentum are, at most, 
three-dimensional, the mathematical concept of a vector may be extended 
to any (integer) number of dimensions. The familiar scalar (dot) product 
of two Cartesian vectors is immediately generalized to n dimensions—the 
scalar product of two n-dimensional vectors A and B is 


= So AkBr = A1 By + A2B2 +--+ + AnBn. (4.1) 
k=1 


The geometric interpretation of A-B is that it returns a scalar, the size of 
which is the length of the projection of vector A onto vector B , multiplied 
by the length of vector B. (We could equally well think of A-Bas giving 
the length of the projection of B onto A, multiplied by the length of A.) 

If we insist that A and B be unit vectors, then the interpretation is 
simplified. Since the length of each vector is 1, A-Bis simply the projection 
of either one of the vectors onto the other. In the xy plane, for example, if 
A is the unit vector in the x-direction, and B a unit vector at an angle of 
1/3, then A- B = cos(r/3) = 1/2, which is the projection of A onto B. 

Imagine a Cartesian coordinate system in a three-dimensional space. 
Each of the three unit (length 1) vectors é1, é2, and é3 lies along one of 
the three Cartesian axes.” These unit vectors are orthonormal: the scalar 
(or dot) product of any one of the unit vectors with any other one is 0 
(orthogonality), and the scalar product of any one of the unit vectors with 
itself is 1 (normality, i.e., the vectors are of unit length). 

Clearly, any vector, say y, in this three-dimensional space may be 
written as a linear combination (a weighted sum) of ê, é2, and és: 


V = a6; + bêz + yé3 (4.2) 


where a, 3, and y are suitably chosen constants. We say that €1, é2, and 
êz span the space, or that they constitute a complete set of vectors, or a 
set of basis vectors, in the space. 

However, we could write V as a linear combination of any complete 
set of vectors in our space. For example, we could define a new Cartesian 
coordinate system by rotating our original system. Our unit vectors along 
the new Cartesian axes then point in different directions than ê, é2, and 
é3, but they still constitute a complete set (and are still orthornormal), so 
we can write y, or any other vector in the space, as a linear combination 
of these new basis vectors. 

In an n-dimensional space, any set of n orthogonal vectors forms a 
complete set. However, non-orthogonal vectors may also form complete 
sets. The necessary condition such that a set of n vectors forms a complete 


2 Note that here the “hat” notation denotes a unit vector, not an operator. 
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set—such that any vector in the n-dimensional space may be written as 
a linear combination of the vectors in the set—is that they be linearly 
independent. Linear independence means, in essence, that none of the n 
vectors in the set can be written as a linear combination of the other n — 1 
vectors in the set. 

As a simple example, consider the xy plane (n = 2). The Cartesian 
unit vectors and g% are orthonormal (their scalar product fulfills the 
orthonormality conditions set forth above), and thus linearly independent: 
we cannot write ĉ as a linear combination of the (only other) vector ĝ (and 
vice versa). That is, there is no constant a such that ĉ = ag. Accordingly 
(and obviously), any vector in the zy plane may be written as a linear 
combination of the set ĉ and 4%. 

Now, however, consider the set of unit vectors: #, (¢+9)/V2. It is impos- 
sible to write either vector as a linear combination of the other, so these 
two vectors are linearly independent. They are not, however, orthonormal 
(or even orthogonal). A bit of thought should convince you, though, that 
any vector in the xy plane may be written as a linear combination of ĉ and 
(ê + 9)/Vv2. 

Orthogonality implies linear independence, but linear independence 
does not imply orthogonality. Orthonormality of a set of n vectors in an 
n-dimensional space is thus a sufficient, but not necessary, condition for 
the vectors to span the space. Nevertheless, it is usually convenient to use 
orthonormal basis vectors, just as we would usually choose ĉ and ĝ, not ĉ 
and (ĉ + ĝ)/V2, as a basis set in the ry plane. 

So if Ñ is a vector in an n-dimensional space (where n could be infinite), 
and {é;} is a set of n linearly independent basis vectors in the space, then 
N may be written as the linear combination, 


N = 5 C0; = C10} + C2b2 ia E Cr üna (4.3) 


7 


where the cs are called expansion coefficients. But how do we find the 
cis? For the general case, where our only constraint on the basis vectors 
is that they are linearly independent, finding the c;s may take some effort. 
If, however, the basis vectors are orthonormal, finding the c;s becomes 
straightforward. 

If @ is some vector in the xy plane, and our orthonormal basis set is ĉ 
and ĝ, then @ can be written as 


= aî + cof = apt + ay = (E - ê) + (E-G). (4.4) 


That is, cı and c2 are simply a, and ay, the (magnitudes of the) projections 
of @ onto the x and y axes, respectively (Fig. 4.1); az and a, are, in turn, 
just the scalar products of @ with the unit vectors ĉ and ĝ, respectively 
(cf. Eq. (4.1)). This result immediately generalizes to n dimensions. For 
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Fig. 4.1 Projection of the vector @ onto the ĉ and ĝ basis vectors. 


example, if the set {6;} is not just linearly independent, but orthonormal, 
N may be written as 


i 


4.1.2 Function Spaces 


Now I introduce another, quite abstract, sort of mathematical “space”: 
function space. We may visualize vectors in space—even in an abstract, 
n-dimensional space. But this visualizability evaporates in function space, 
and that can lead to confusion. The problem partly stems from the fact 
that, in common usage, the word “space” suggests something that should 
be visualizable. In mathematics, however, the word space refers not to real 
space, or even to some analog thereof, but to a set of rules that govern the 
behavior of specified mathematical objects. Thus, to say that an object is 
in function space simply means that it behaves according to certain rules. 

The bad news, then, is that function spaces are much more abstract 
than the “regular” vector spaces we’ve encountered so far. The good news 
is that a close analogy exists between vectors in a vector space, and mathe- 
matical functions in a function space. By importing the concepts of vector 
spaces into function space, the abstract nature of the latter becomes more 
manageable. 

Suppose two vectors in an n-dimensional vector space, C and D, are 
orthonormal. The orthonormality condition may be expressed as 


bi siete He 0, if ČD 
EB = SGD, = C.D. + CDs t+ GDn =f ay, 
i=l 
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We know that any set of n such orthonormal vectors forms a complete set, 
and any vector in the space may be written as a linear combination of the 
basis vectors that comprise such a set. 

Function spaces are defined in analogy to vector spaces. Just as a class 
of vectors forms a vector space, so does a class of functions form a function 
space. For example, the set of second-order polynomials on the interval 
(0,1) forms a function space. Another example is the set of real functions 
of x that are square-integrable on some interval (£1, £2). 

For complex functions g;(x) and g(x), the orthonormality condition is 


oP if j Ak 
[ 9;(2)" 9. (2) dx = 03, = { : if f k l (4.7) 
where ĝjk, the Kronecker delta, is defined as indicated by the right-hand 
side of Eq. (4.7). Equations (4.6) and (4.7) are clearly analogs of each other. 
For vectors, we take the products of the corresponding components of each 
vector, and sum over all such products. For functions, we take the product 
of the two functions’ values at each x, and integrate over «.4 

In analogy with vectors, any function in some space can be written, or 
expanded, as a linear combination of any complete set of basis functions 
for that space. In an n-dimensional function space, any set of n linearly 
independent functions forms a complete set. And, as for vectors, orthonor- 
mality is a stronger condition than linear independence, so any set of n 
orthonormal functions constitutes a complete set. 

Explicitly, if {g,(x)} constitutes our complete set of basis functions, 
then, 


f(a) =>" angola), (4.8) 


where the ans are expansion coefficients. If the g,(x)s are orthonormal, the 
ans take a form analogous to that for the expansion coefficients for vectors 
in an orthonormal set: 


Am = i Gm (a) f (a) da. (4.9) 


1 


In Eq. (4.5), the expansion coefficients take the form NÑ - 6;, that is, the 
projection of the arbitrary vector N onto each of the basis vectors. In 
Eq. (4.9), the expansion coefficients take the form of the projection of the 
arbitrary function f(x) onto each of the basis functions g(x). 


3 A real function f(a) is called square-integrable if it satisfies tee { f(a) }?dx < 00. 

4 For complex functions, we also conjugate one of them, as in Eq. (4.7). Had we 
considered vectors with complex components, an analogous conjugation would appear 
in Eq. (4.6). 
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We thus have a fundamentally important result: just as any vector can 
be expanded as a linear combination of any complete set of vectors in the 
vector space, so can any function be expanded as a linear combination of 
any complete set of functions in the function space. We also know, for both 
vectors and functions, the forms of the expansion coefficients.® Applied 
to quantum states, these results become some of the most important and 
pervasive tools in quantum mechanics. 

Like vectors and functions, some quantum states are described with a 
discrete set of expansion coefficients, others with a continuous set. In either 
case, though, they must obey certain constraints on their mathematical 
behavior—this restricts them to a particular type of space: Hilbert space. 
(The state spaces in quantum mechanics may be of varying dimensions 
and contain within them the eigenstates of various operators, but they all 
must be Hilbert spaces.) Any state in some Hilbert space may be expanded 
as a linear combination of any complete orthonormal set of states in that 
Hilbert space. 


4.2 Dirac’s Bra-ket Notation 
4.2.1 Bras and Kets 


The great 20th century physicist Paul Dirac introduced into quantum 
mechanics a notation that is exceptionally useful, and extensively used; 
it sometimes seems that Dirac’s creation has a life and power of its own. 
We will build this notational system gradually, and point out pitfalls along 
the way. Please realize from the outset that Dirac notation is just nota- 
tion. Nevertheless, it often appears mysterious and confusing at first. Once 
you see its simplicity, though, Dirac notation becomes an elegant and 
powerful tool. 

The starting point for Dirac notation is simple: a state, say Y, is written 
as a ket, |Y). Corresponding to a ket such as |W) is a bra, denoted (Y|. We 
will have much more to say about the relation between bras and kets later. 
For now, we simply point out that if a bra on the left meets up with a 
ket on the right, for example, (U|W), the result is a bra-ket, or “bracket”. 
Since such meetings are so common, Dirac notation is often called bra-ket 
notation. 

As a simple but important first example of bra-ket notation, consider 
the expansion of a state Q in a set of states {¢,}: 


2 =D ade. (4.10) 
k 


5 For more rigorous treatments see, for example: Jackson (1999), Chapter 2; Arfken 
(1970), Chapter 9; Kovach (1984), Sections 2.3-2.5. 
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In bra-ket notation this becomes 


|) = So arék). (4.11) 
k 


The defining relation between a ket state and the corresponding bra 
state, such as between |Q) and (Q|, is 


I) = E ol) — O = F 95 (45. (4.12) 


Equation (4.12) indicates that if we write out (Q| and |Q} as superpositions, 
the correspondence between them is maintained by complex conjugating 
the expansion coefficients of (Q| and |Q) relative to each other. 

Note that Eq. (4.12) defines a correspondence—not an equality. It’s 
tempting to think that (Q| should be expressible in terms of |Q)—that an 
equation should exist relating |Q} and (Q|. But that would be to miscon- 
strue the role of the bra state, which is (as we shall soon see) to facilitate 
a mathematical operation: taking the scalar product. 


4.2.2 Labeling States 


There is no new notational form for operators or numbers (such as eigen- 
values) in bra-ket notation. Thus, for example, an eigenvalue equation such 
as AV; = a, VU; becomes 


A|W,) = ap|Ve). (4.13) 


We may, however, label bras and kets by any convenient means, so we could 
have chosen different state labels. A bra or ket is always a quantum state, 
so it seems a bit redundant to write Y in the kets. Instead, we could use 
the eigenvalues, or just the index k, to uniquely label our states: 


Alax) = aplak), or (4.14) 
Alk) = aglk). (4.15) 


Equations (4.13), (4.14), and (4.15) are all equivalent. But they may 
not all be equally transparent, at least until you become comfortable with 
bra-ket notation. In particular, an eigenvalue equation such as Eq. (4.14) 
could easily be confusing. On the left side of Eq. (4.14) a, appears inside 
of a ket, a state. Then the same ket appears on the right side, mysteriously 
multiplied by a factor ag, now not written inside of a ket. Just what is this 
ak, anyway? 

Such an equation is easily understood if one recognizes how the various 
objects are to be interpreted. In bra-ket notation, if something is written as 
a bra or ket, such as |a} or (apl, it always represents a quantum state. With 
but a few (usually obvious) exceptions, if some object in Dirac notation is 
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not written as a bra or a ket, it is not a state, but some mathematical 
object such as an operator or eigenvalue.® In Eq. (4.14), Jap) is the k’th 
eigenstate of the operator Â, while a; is the eigenvalue corresponding to 
the state |ap). 

Because the mathematical objects in Dirac notation may be unfamiliar, 
the following simple rules may help you to make sense of things. 


e A bra-ket is a scalar product—therefore, it is (as we will see), a number. 

e An operator does not act on numbers. 

e An operator does act on states, that is, bras or kets—the result is a 
new state. 


Dirac notation is not well-suited to dealing with specific functional 
forms of wavefunctions. For example, if UV; = koexp(ia;x), with x a 
constant, and A = d? /dx?, then 


A ‘ d2 
AW; = A{ko exp(ia;x)} = Ta“ expliajz) = -K003 expliajx). (4.16) 


But because the label inside of a ket is just a label, we certainly would not 
write: 
X d 3 
A|W;) = Alko exp(ta;x)) = ga“ exp(iaj;x)) = —Koaj| exp(iajx)}. 
(4.17) 


Often, though, our calculations do not involve specific functional forms, 
relying instead on the fundamental structure of quantum states—for 
example, the fact that they are normalized, and that basis states are 
orthonormal. In such cases, bra-ket notation can be a powerful tool. 


4.3 The Scalar Product 
4.3.1 Quantum Scalar Products 


The scalar product—also called the inner product—of quantum states is 
analogous to the scalar product of ordinary vectors.’ The projection of a 
vector in some direction clearly is of great utility in classical physics. We 
may, for example, need to find the component of a force or field along some 
Cartesian axis. What work might we ask a scalar product to perform in 
quantum mechanics? 


6 Important exceptions include state operators (which we will not use), and expres- 
sions such as (2|W) = W(x). Also, in matrix mechanics, where states are normally 
represented as row or column vectors (cf. Chapter 6), Dirac notation may also be used. 

T For both vectors and quantum states, the terms scalar product, inner product, and 
dot product are synonomous, although the last term is not ordinarily used in quantum 
mechanics. 
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The postulates define the fundamental operations we wish to carry out 
in solving quantum-mechanical problems. Postulate 1 instructs us to find 
solutions of the time-dependent Schrödinger equation. Postulate 2 tells us 
that we must obtain the eigenvalues and eigenstates for the operator of 
interest. Then, by Postulate 3, we expand the system state in a superposi- 
tion of these eigenstates. The measurement probabilities are then given by 
the complex squares of the expansion coefficients. 

In a superposition, such as |Q) = 5°, a%.|¢x), the role of the expansion 
coefficients, the a,;s, is clear: they are “how much” of each |¢,;) there is 
in |Q). Bra-ket notation does not tell us how to find the ajs, but it does 
provide a means to write them in terms of a set of basis states without 
committing to the specific form of those states. 

To see how this happens, let’s look at the simplest possible case. Assume 
that there are only two basis states, |¢1) and |¢2). Now, when we take the 
scalar product of two ordinary vectors, as in Eq. (4.1), both vectors must 
be expressed in the same coordinate system. Similarly, when we take an 
inner product in quantum mechanics we work in a common basis for both 
states. Suppose, then, that we “expand” the basis states |1) and |@2) in 
their own ¢ basis. Clearly the expansion coefficients will simply be a, = 1, 
az = 0 for |¢,) and a; = 0, az = 1 for |d2) (analogous to writing, say, the 
unit vector ĉ, or ĝ, in the ĉ, ĝ, 2 basis). 

In Dirac notation, the scalar product of |Q) and |¢1) is the bra-ket 
(¢1|Q).8 When a bra-ket such as (¢1|Q) appears, summation is implicit, so 


(b1|Q) = (1(¢11 F 0 (2) (aile) DE alz) ) 


= 1ai(¢1|¢1) + 1 a2(¢1|¢2) + 0 a1 (62/61) + 0 a2(¢2|¢2) = a1. 
(4.18) 


Here I have used the orthonormality of the |¢;)s, that is: (¢1|¢1) = 
(2l¢2) = 1, and (¢1\¢2) = (¢2/¢1) = 0. The scalar product (|) is 
analogous to the scalar product of an arbitrary unit vector, say A, with a 
unit basis vector, such as ĉ. For Â. ĉ, the result is just Az, the projection 
of A onto ĉ. For (;|Q), the result is a1, the projection of |92) onto |¢1). 
Similarly, we could obtain (¢2|Q) = a2. 

Now consider a second superposition, |y} = gi|¢1) + g2|¢2). From 
Eq. (4.12), the scalar product of our two superposition states, (y|Q), 
evidently is 


(IN) = (gi (11 + 95 (621) (orld) + aalg2)) = ghar +9502. (419) 


This scalar product of normalized states is analogous to the scalar product 
of two unit (but not necessarily basis) vectors. 


8 The notation (¢1, 2), which I will avoid, is also used for the scalar product. 
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Why should the form of the scalar product that I’ve employed provide 
the projection of one state onto another? Specifically, why are the bra 
coefficients complex conjugated? (That is, why define the bra as in Eq. 
(4.12)?) Consider the scalar product of a normalized state, such as |), 
with itself. Evidently, if this is to constitute the projection of |Q) onto 
itself, we must have (Q|Q) = 1. This is certainly satisfied for the form 
of scalar product we have proposed since, by the normalization condition, 
|a1|? + |ag|? = 1. If, however, we do not complex conjugate when taking 
the scalar product, the result will not, in general, even be a real number. 

For states with continuous eigenvalues, such as two wavefunctions ¢)(x) 
and p(x), the scalar product becomes 


b 
(op) = i. o*(@w(a)ae, (4.20) 


where the limits of the integral correspond to the domain of x. We might 
have expected that the integral in Eq. (4.20) would appear in bra-ket nota- 
tion as Si (ple) de. This, however, is incorrect. In analogy with the discrete 
eigenvalue case, when a scalar product appears, such as (p\w), integra- 
tion is implicit in the notation. Thus, (p|q) corresponds to the integral 


of o” (x)(x). 


4.3.2 Discussion 


From Section 4.1.2, we know that if some function f(x) is expanded as a 
linear combination of the orthonormal basis functions g;(2), 


f(x) = X akgr(2), (4.21) 
k 


then aj, the jth expansion coefficient, is given by 


b 
a= | gj (x) f(x)dx. (4.22) 


But this is just (g;|f), the scalar product of f(x) and g,(x). Thus, if |p) 
is a basis function, then (p|7)) in Eq. (4.20) is really just the expansion 
coefficient discussed in Section 4.1.2. For the continuous eigenvalue case, 
then, as for the discrete case, we interpret the scalar product of states in 
analogy with the scalar product of unit vectors; that is, as the projection 
of one state onto the other. 

So if we expand a state as |€) = `, bx|Gx), then b; = (G,|€), and 
this holds regardless of whether ((;,|€) involves a sum, as in Eq. (4.19), or 
integration, as in Eq. (4.20). Thus, we may write 


IE) = X. del Be) = X (Fel) lr). (4.23) 
k 


k 
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A useful, general, and easily proven relation is 


(ulw) = (wla). (4.24) 


Thus, the probabilities in Eq. (4.23) may be written as |b;|?, but also as 
Krle)? = (Br lE) (El Bx). 

From Eqs. (4.19) and (4.20), we see that the scalar product of states, 
a bra-ket, always results in a number, just as does the scalar product of 
vectors.” When you see scalar products in equations, such as in Eq. (4.23), 
then, keep in mind that they’re just numbers, and you can move them 
around in your calculations like other numbers. Please realize, though, that 
this applies only to the entire scalar product; for example, (G,|€) is a num- 
ber and may be treated as such. But we certainly can’t separate this into 
(G,| and |£) and treat these objects as numbers; they’re states. 

In Dirac notation, an expectation value (cf. Chapter 3) looks much like a 
scalar product, but with an operator sandwiched between the states: (A) = 
(| A|W). Because expectation values are averages of measurable quantities, 
they must be numbers. But why, mathematically, should (W|A|W) be a 
number? 

In Section 4.2.2 I remarked that operators have the effect of changing a 
quantum state into another quantum state. In light of that fact, we can see 
that an expectation value is indeed a scalar product. For example, Â|Y) 
yields a new state: call it |Y). Then we may write 


(A) = (W/A]Y) = (x1 (ål®)) = (IY), (4.25) 


which clearly is just a scalar product—a number. So an expression such 
as (U|A|W) is a scalar product, but it’s not the scalar product of the two 
states that appear explicitly. Rather, it’s the scalar product of one of those 
states with the new state created by the operator acting on the other of the 
original states. 

Expectation values occur frequently in quantum mechanics, so it’s worth 
writing one out explicitly, in Dirac notation. Suppose G| gi) = 9;\9;), and 
|W) = Yp celga); then 


(G) = (UIG|Y) = J chGml (G cnlgn)) = X Cin Gl (Gnenlgn)) 


m,n 


= 5 CCngnOmn = 5 [cn] gn. (4.26) 


9 If, however, the integrand in a scalar product depends on variables other than the 
variable of integration, the resulting scalar product will also depend on those other 
variables. 


Representations 47 


Note that Eqs. (4.26) and (3.6) agree: the expectation value is the sum 
over all possible measurement results (eigenvalues), each weighted by its 
probability. 

Also in Section 4.2.2, I pointed out that Dirac notation is not well-suited 
to dealing with specific functional forms of quantum states. Although this 
may seem restrictive, it often affords great freedom. Because we are not 
concerned with specific forms, we really don’t care whether a scalar product 
such as (¢|W) involves a summation or an integration, or about the limits of 
such an integration. Such information is necessary in calculations involving 
functional forms for states, but not in the sort of abstract calculations done 
with bras and kets. In Dirac notation, (¢|W) symbolizes the scalar product, 
regardless of what means would be necessary to actually calculate it. 


4.4 Representations 
4.4.1 Basics 


Without a clear understanding of how eigenstates, superpositions, and rep- 
resentations are related, one cannot truly grasp the structure of quantum 
states. In Section 2.3.1 we discussed a state |Q) that could be expanded in 
the eigenstates of an operator A, the |W)s, or the (different) eigenstates of 
Ê, the |¢)s: 


N N 

I9) zm 5 Cm|Vm) = 5 (Pml) Em), (4.27) 
N N 

9) = X arlok) = S-(Ge12)|bx)- (4.28) 
k=1 k=1 


These equations provide two different ways to write |Q), that is, two differ- 
ent representations of |Q}. Clearly, the key elements of writing a state in a 
particular representation are (1) finding the relevant set of basis states (in 
this case, either the |)s or the |¢)s), and (2) finding the projection of the 
system state onto each basis state, that is, finding the appropriate scalar 
products. 

Also in Section 2.3.1 we saw that, to determine the probability dis- 
tribution for measurements of B in the state |Q), we expand |Q) in the 
eigenstates of B, the |)s. The motivation for so doing was that the prob- 
abilities are then easily obtained, and the action of B is simple because 
it acts on its eigenstates. Of course, this same procedure would serve to 
determine the probability distribution of B for any state. 

Whenever we write a state as a particular superposition, such as in Eqs. 
(4.27) and (4.28) (or, for that matter, as an eigenstate), we are choosing 
a representation. Our choice is typically made for reasons similar to those 
employed above: to enable us to deal with the action of some operator on the 
state. In Chapter 5 we will see important quantum-mechanical operators 


48 The Structure of Quantum States 


that do not represent observable quantities. For these, also, we typically 
choose a representation in which the action of the operator is simple. 


4.4.2 Superpositions and Representations 


Suppose that, rather than |Q), our system is in the state |¢;), the j’th 
eigenstate of B. The “expansion” of |¢;) in the ¢ representation consists, 
obviously, of only one term: |@;) itself. Of course, we could expand |¢;) in 
the eigenstates of A: ld;) = 24 FolVa). A question then arises: is |¢;) a 
superposition, or isn’t it? 

The answer is: it’s both. When expressed in the ¢ representation, |¢;) 
is a single basis state. But in the Y representation, |¢;) is a superposition. 
Referring to |¢;) as a superposition—or not—has meaning only if it’s clear 
what representation we’re dealing with. 

This is a fundamentally important and often misunderstood point, so 
Pll elaborate a bit. If a state is expanded in the eigenstates of some oper- 
ator, and if that “expansion” consists of only one term, then the state is 
not a superposition in that representation. But if there is more than one 
term in the expansion, the state is a superposition in that representation.'° 
Although in physics we may refer to some quantum state as “a superposi- 
tion,” please realize that this designation generally makes sense only with 
respect to some specific representation, that is, with respect to expansion 
over some specific set of basis states. 

A simple example makes things more concrete. Consider an infinite 
potential well: a particle of mass m in a (one-dimensional) region where 
the potential energy is zero for 0 < x < L, and infinite elsewhere (see 
Fig. 4.2). Classically, this corresponds to a particle subject to zero force for 
0 < x < L, and infinite force at x = 0 and x = L—thus, this system is 
often called a “particle-in-a-box.” 


x=0 x=L 


Fig. 4.2 The infinite potential well, or “particle-in-a-box.” 


10 Often, a state is a superposition in every representation, that is, no representation 
exists in which the state may be written as a single basis state. 
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In Section 9.2, the eigenstates and eigenvalues of the Hamiltonian 
operator H for the particle-in-a-box are found to be 


Pnl) = V/2/L sin(nra/L), (4.29) 
nen h? 

E, = InI (4.30) 
So, the particle-in-a-box problem is solved, right? Yes—and No! Although 
we have obtained the energy eigenstates!! (and eigenvalues), these are only 
a set of basis states. From this set, an infinite set of other, legitimate 
particle-in-a-box states could be formed, by superposition. And, unlike the 
energy eigenstates, such superpositions will be time dependent (as we will 

see in Chapter 11). 

Fully understanding the structure of the quantum states for even this 
simple system involves considerable subtlety—but it also leads to consid- 
erable insight. Let’s begin with an eigenstate of Ñ, say w2.!2 Expanded in 
the H representation, we simply have wz» itself. The expansion coefficient 
is 1, as is the probability: we will, with certainty, obtain Ey = 277h?/mL? 
upon measurement of the energy. Of course, similar comments apply to any 
one of the pns. 

Recall that each possible position, each x, corresponds to an eigenstate 
of ĉ. We see that in Eq. (4.29) the w,,s are already written in position rep- 
resentation, that is, as functions of x. Moreover, the Yns are superpositions 
in the position representation—else they each would consist of only a single 
x value. Note that the eigenstates and eigenvalues for Å are discrete for 
this system, while those for ĉ are continuous—an entirely legitimate and 
very common situation. 

Now consider a superposition of the w,(x)s, such as 


a(x) = -f vala) + valo)} = =f sin(2nx/L) +sin(4ne/L)}. (4.31) 


1 

v2 v2 
A state’s representation is determined by the basis, the eigenstates, over 
which it’s expanded. That’s a rather odd statement in this case: though 
expanded in the H representation, the eigenstate constituents are written 
as superpositions in the x representation. So, which representation is ®(x) 
written in? 

In Eq. (4.31) there is just one coefficient for each eigenstate of H. There 
are, however, two coefficients for each eigenstate of ĉ: at each value of 
x there is one contribution from w2(x), and another from %4(x). Thus, 
(x) is written in the H representation. Such cases are common: to time 


11 The states in Eq. (4.29) are solutions to the time independent Schrödinger equation. 
These are sufficient for present purposes, and are what is found in most texts. 

12 For now, I avoid writing wavefunctions, such as the ~j(x)s, in Dirac notation, as 
subtle notational issues arise when doing so. 
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evolve quantum systems we often expand position-space states in the H 
representation (see Chapter 11). 

Clearly, #(x) is an eigenstate of neither ĉ nor H, that is, ®(x) is not a 
single basis state in either the position (x) or energy (H) representations. 
In fact, there is no representation of particle-in-a-box states in which ®() 
is a single basis state. If this seems surprising, it shouldn’t. 

As we'll see in Chapter 5, corresponding to any Hermitian operator 
(and therefore, by Postulate 2, to any observable) is a complete set of 
eigenstates—the basis states of that representation. Although we can build 
any possible system state by superposing such eigenstates, such basis states, 
the concept of a system state is otherwise independent from that of a basis 
state, or eigenstate. The system state could be just a single eigenstate, but 
it need not be. Therefore, the system need not be in an eigenstate of the 
operator of interest—nor, indeed, of any Hermitian operator. 


4.4.3 Representational Freedom 


A crucial point should, by now, be clear: there is only one quantum state 
of the system,!? and this quantum state determines the probability distri- 
butions for all of the relevant observables. Changing representations does 
not change the state. In the example of Section 4.4.2, the Yns are eigen- 
states of H but not of ĉ, and that fact is not altered merely by changing 
representations. Restated, each 7; remains an eigenstate of A , even if we 
choose to write it as a superposition of the eigenstates of ĉ. 

We may choose to work with the system state, which embodies infor- 
mation on the system’s various observables, in various representations. As 
an abstract entity, however, the state itself is representation-free—a codifi- 
cation of information about the quantum system that exists independently 
of any language, any representation, we choose to write it in. In Eqs. (4.27) 
and (4.28), for example, the state |Q} is written in two different representa- 
tions. But we could, of course, simply use the ket |Q), a representation-free 
object, to symbolize the state. The situation is similar for scalar products: 
although they are valid in various representations, the scalar product itself 
is representation-free. 

As a crude but illustrative analogy, imagine discussing a dog with either 
an American or a Frenchman. In one case we use the word dog; in the other, 
the word chien. Either word is perfectly legitimate, although one or the 
other is more convenient, depending on the audience. However, the actual 
animal, a physical entity, obviously exists quite apart from the word we 
use to represent it; in fact, it exists even if we choose not to represent it in 
words at all. 

Dirac notation capitalizes on the representational freedom of quantum 
mechanics; one way to see this is through expectation values. In Eq. (4.26), 


13 There are infinite possible system states, but only one in any particular instance. 
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I calculated the expectation value (G) = (U|G|V) by expanding the system 
state in the eigenstates of Â, the operator of interest. This is the usual 
way to calculate an expectation value, but the expression (¥|Ĝ|Y) doesn’t 
instruct us to do this. Like states, scalar products are not tied to any 
representation, and (¥|G|W) simply instructs us, without reference to a 
particular representation, to take the scalar product of (| and G|W). 

How could one calculate (|G|W) in a representation (basis) other than 
that formed by the eigenstates of G? We could write out |W) in this other, 
“new” basis, and then transform this expansion into the G representation, 
essentially reproducing Eq. (4.26). Alternatively, by determining the action 
of G on each of the new basis states, we could work directly in the new repre- 
sentation (we will revisit operators in alternate representations in Chapters 
5 and 6). As an abstract entity, however, (Y|G|V) exists and is unchanged 
regardless of what representation we choose to work in—regardless, in fact, 
of whether we choose any representation at all. The bra-ket expression 
(Y|G|V) symbolizes (G) without commitment to any representation. 

Students typically first see expectation values in the context of wave- 
functions. Given a wavefunction U(x), and an operator A, the expectation 
value (A) is 


(U|A|v) = J Š U(x) ÂY (x)dz. (4.32) 


— 00O 


It’s tempting to regard Eq. (4.32) as the definition of (A) for wavefunctions. 
But that’s not quite right; (A) is the scalar product of (Y| with A|), and 
(as for (V|G|W) above) that says nothing about representation. Although 
the integral in Eq. (4.32) is the conventional expression for (A), it implic- 
itly selects the x (position) representation. The bra-ket object (W|A|W), 
however, provides a representation-free expression of (A).!4 

In Section 3.1.2, we considered (x), that is, the case A — ĉ. There we 
found that, because ĉ = x for wavefunctions (in the x representation), (£) 
has a simple interpretation in terms of probabilities. This simplicity is lost 
if A cannot be treated simply as a variable (like #). Nevertheless, Eq. (4.32) 
still holds: the form of A will vary with the representation in such a way 
that the expectation value is preserved. 

Note that our discussion has been in keeping with the vector analogy: 
the scalar product of two vectors is independent of our choice of coordi- 
nate system. This, however, does not imply that all coordinate systems are 
equally easy to work in, and it is often well worth choosing wisely. Similarly, 
the scalar product of two states is independent of our choice of represen- 
tation. But, also as for vectors, our choice of representation can affect the 
ease of calculation. 


14 Of course, to actually calculate (A) we must choose a particular representation to 
work in, and the x representation is one legitimate choice. 
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Finally, as abstract mathematical objects, both vectors and their scalar 
products exist regardless of our choice of coordinate system—indeed, they 
exist even if we choose no coordinate system at all. The same may be 
said for quantum states and their scalar products: they exist regardless 
of our choice of representation—indeed, they exist even if we choose no 
representation at all. 


4.5 Problems 


1. Suppose that |A;), |Az), and |A3), on the one hand, and |B;), |Bo), 
and |B3), on the other, form two different complete sets of (orthonor- 
mal) states of some quantum system. Suppose, also, that F is a 
Hermitian operator corresponding to some observable quantity F, and 
that F satisfies 


F\Ax) = frl Ar). 


(i) What possible values could result if a measurement of F is made 
on an arbitrary state of the quantum system? 
(ii) Suppose that the system state is |) = %5|A1)+ Az) + TE |A3). 
(a) What condition must a? + 6? + 7? obey? 
(b) What is the probability distribution for measurements of F 
in the state |W)? (Provide the possible measured values and the 
corresponding probabilities. ) 
(iii) Now suppose that the system state is |®) = |B2). What is the 
probability distribution for measurements of F in the state |®)? (You 
can’t get a number; just carry it out as far as you can.) 

2. In this problem, {|@;)} and {|3;)} are sets of orthonormal basis states, 
while c1, c3, and € are constants (numbers), and 


N 
lbi) = X bjxldr),  Âlpn) = anla). 
k=1 


In the following, Latin indices take on all possible values, while a 
numeric index, such as in |w3), takes on that particular value. 
(i) What sort of object (scalar, state, etc.) is: 


G(3|3) (4 ABs) las)? 
(ii) Simplify [ as much as possible: 


T= (Bol cr (bal EÂ Ibs) (bj lb%)(dilm) 80) |Gs). 


(iii) What sort of object is r? 
(iv) Simplify = as much as possible: 


E= (Bol c3 (2| EÂ Ibs) (djl) (mlx) 181) |Ga). 


5 
Operators 


In Newton’s theory reality was determined by a material point in space and time; 
in Maxwell’s theory, by the field in space and time. In quantum mechanics it is 
not so easily seen. 

The concepts, which [Newton] created, are even today still guiding our thinking in 
physics, although we now know that they will have to be replaced by others farther 
removed from the sphere of immediate experience, if we aim at a profounder 
understanding of relationships. 

Albert Einstein! 


There is much yet to be done in quantum mechanics. Calculating 
time-independent states and attendant energy eigenvalues is the problem 
that introductory students are typically most familiar with. But quantum 
mechanics is far richer than just these problems. If the state of our quan- 
tum system evolves in time, we will likely want to calculate that time 
evolution. We may want to predict measurement results for observables 
other than energy. We have already seen that the mathematical objects 
corresponding to the physical process of measuring an observable are oper- 
ators. Other operators correspond to non-measurement processes, such as 
time evolution. 

Operators play a central role in quantum mechanics. The goal in this 
chapter is not to treat the wide spectrum of quantum-mechanical operators 
in its entirety, but rather to discuss the most important classes of operators, 
and to present a broad overview of why and how operators are used. 

We might first ask why operators assume a central role in quantum 
mechanics, but not in classical mechanics. Consider again the fundamental 
entities in the two theories. In classical mechanics we specify the state of a 
particle by its position and momentum. From the knowledge of these phys- 
ical quantities as a function of time we may calculate, by simply applying 
the relevant definitions, the values of other dynamical quantities. 

In quantum mechanics the fundamental entity is the quantum state, 
a probability amplitude distribution, which in general does not uniquely 
determine any dynamical quantities. The connection to physically observ- 
able predictions is through the measurement algorithms as developed in the 
postulates. To investigate the behavior of a quantum-mechanical system, 


1 A. Einstein, in Schilpp (1949), pp. 81, 83, 31, 33. 
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then, we must investigate the behavior of the quantum state. Let me try to 
clarify this statement. 

Suppose we are interested in how the behavior of some physical system, 
for example, a charged body in an electric field, might be changed if it were 
rotated. In classical physics, we calculate the quantities of interest, such as 
the object’s trajectory, directly (though perhaps not easily!). We can also 
try to envision how the field might interact with the object after rotation. 

In quantum mechanics, we cannot calculate the properties of the object 
directly. We must consider the quantum state as the fundamental entity 
in our calculations, and determine the effects of the rotation on the state, 
rather than the object itself. Only then can we extract (probabilistic) physi- 
cal predictions. While we may try to press our classical intuition into service 
to imagine what will occur, that intuition is often unreliable in quantum 
mechanics. We should thus not think in terms of what happens to “the 
object”, but rather we should think in terms of, and calculate, what happens 
to the quantum state. 

In other words, in quantum mechanics, physical processes—such as 
measurement, time evolution, or rotation—are “done to” the quantum 
state. And the way that something is done to the quantum state is with 
mathematical operations, that is, with operators. 

Nevertheless, thinking in terms of particles—physical objects— 
undergoing processes can be useful and intuitively palatable, and physi- 
cists very often do just that. But there are times when we can get into 
trouble by employing such pictures. You should always realize that the 
fundamental entity in quantum mechanics is not a continuously existing 
particle (or ensemble of particles), but the quantum state. 

Please note that none of the foregoing means that real, continuously 
existing physical particles cannot exist. It simply means that such particles, 
if they exist, play no essential role in quantum mechanics. 


5.1 Introductory Comments 


As is usual in quantum mechanics, the operators that we will deal with 
are all linear operators. If Ê is a linear operator, |) and |T} are arbitrary 
states, and a and b are (possibly complex) constants, then linearity means 
L must satisfy: 


i (aly) + olr) ) = ab |¥) + bE IP). (5.1) 


In words, linearity means that an operator has no effect on constants (num- 
bers), and that its action on states obeys the distributive property. Simple 
though it is, linearity is of great importance: it determines the basic rules 
by which an operator may be manipulated. With practice, applying these 
rules becomes second nature. 
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Before considering specific classes of quantum-mechanical operators, 
let’s try to get a broad overview of the subject. First, consider a fact that is 
fundamentally important, but often not obvious: guantum-mechanical oper- 
ators must “act” on something—in particular, they act on either states or 
other operators. 

For example, the commutator of two operators A and B is defined as,” 


[A, B] = AB- BA. (5.2) 


Commutators appear frequently in quantum mechanics. To actually cal- 
culate them, however, it is often essential to realize that the operators 
involved don’t stand on their own; they must always act on a state. For 
example, suppose the operator Ais simply the variable x, and B is the dif- 
ferential operator d/dx. The interpretation of [A,B] may be unclear, but 
now assume that the operators act on a state (wavefunction) f(x). Writing 
out [A, B] f(x) explicitly, we have, 


(AB - BA) = (05 - $2) f=08 df (s+) - f #0. (5.3) 


dx 


Clearly, the failure to realize that operators act on states could, in such a 
case, lead to confusion. 

A useful way to think about the action of operators on states is that 
operators have the effect of changing a quantum state into another quantum 
state. An obvious exception to this is the identity operator, Î , which does 
what you might suspect. For any state |W) the action of Î is given by: 


i}W) = |W). (5.4) 


As a simple example of an operator changing a state, consider the action 
of an operator A on its eigenstates. If the eigenvalue equation for Ais 


A|W;) = a;|¥;), (5.5) 


then the effect of A on the state |W,) is simply to multiply |V,;) by the eigen- 
value a; (a number). We may legitimately think of a;|V,;) as a new state, 
and thus give it a new name. For example, we could say that a,|¥;) = |Q,). 
Nevertheless, |¥;) and |Q;) are so similar that we usually don’t introduce 
new state labels in such a case. Of course, if A were to act on a non- 
eigenstate, the new state would not be so simply related to the original 
state. 


2 We will study commutators in Chapter 7. 
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5.2 Hermitian Operators 
5.2.1 Adjoint Operators 


In an equation such as p|løk) = pplk), the operator p clearly acts to the 
right, on the state |¢,). It seems natural to make a similar assumption 
for scalar products—that is, that (|A|d,) really means (©|(Al¢x)), the 
curved brackets indicating that A acts on the state to its right, on the 
ket. But the bra, (|, is also a state, and so could also be acted on by an 
operator. You may suspect, then, that we will routinely need to deal with 
operators acting on bra states, but in practice such occasions are rare. There 
is, however, an important type of operator—the adjoint operator—that is 
defined by its action on bras, and to which we now turn our attention. 

An operator acting on a state creates another state. PI denote by |&) 
the new state created by A acting on |W); that is, A|W) = |£). From Section 
4.2.1, the correspondence between bras and kets is defined by 


I9) = X e165) — se (ġ;l: (5.6) 


j 


For an arbitrary scalar product, such as (U|Aldx), we do not expect, in 
general, that (W|(Aldx)) = ((VJA) lx) = (AV|@,) will hold. For example, 
for two wavefunctions a(x) and B(x), the equation 


fro [As(@)] ax = f [Ara (2)] Bde, (5.7) 


will in general not hold. Note that in Eq. (5.7) the operator acts only on 
the state within the braces, and that it is conjugated when acting to the 
left. 

Suppose, however, that there exists an operator At whose action to the 
left, on an arbitrary bra (W|, produces a new bra that corresponds to the 
ket produced by A acting to the right, on the ket |W). That is, 


Alt) = |é) — (UAT = (1. (5.8) 


The operator At is called the adjoint of A. It may also be defined in terms 
of scalar products: Operating to the left with A‘ in an arbitrary scalar 
product produces the same result as operating to the right with A. 


(U| (Aloe) = (UIA) lor). (5.9) 
Note that Eq. (5.9) holds even though A and Ât act on different states. 


Equation (5.8) therefore seems more transparent; nevertheless, Eq. (5.9) is 
a direct consequence of Eq. (5.8). 


3 Suppose Ê = d/dz; writing g(x)B f(x) clearly means to differentiate f(x), not g(x). 
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Unfortunately, some potentially confusing notational variations exist 
regarding operators generally, and adjoint operators in particular; the 
choice of which to use is largely a matter of taste. Some authors may 
write: (a|A|@) = (a|A@), or (a|At|@) = (Aal@). Others write (a|At|3) = 
(Ata). Finally, as in this book, some authors avoid writing operators 
inside of bras or kets (an exception occurs just before Eq. (5.7)).4 While 
this is less explicit than the alternatives, it is arguably also less confusing 
and ambiguous. 


5.2.2 Hermitian Operators: Definition and Properties 


In general, the adjoint of an operator, such as At, is different than the orig- 
inal operator, A. It could happen, however, that At = A, that is, A could 
be equal to its adjoint. Such operators are called self-adjoint or Hermitian. 
Clearly, then, if an operator Ais self-adjoint, the following holds: 


(U| (Aldi) = (PÂ) ldk) = (UIA) lbk). (5.10) 


By its very definition, then, we see an important property of a Hermitian 
operator: when a Hermitian operator appears in a scalar product it may act 
on either the bra or the ket without changing the result of the calculation. 
Thus if a scalar product involves a Hermitian operator, the curved brackets 
and dagger notation become superfluous and are usually omitted. 

The expectation value of any Hermitian operator A, in any quantum 
state, evidently satisfies: 


(A) = (X| (Âl¥)) = ((UIA)|%), (5.11) 
or, in terms of wavefunctions, 


b b 
(A= | w(@)Av(e)ae = | weir (aac. (5.12) 


From Eqs. (5.11) and (5.12) we see that the expectation value of A equals 
its complex conjugate, or, in bra-ket form, 


(VJÂ|T) = (XÂY). (5.13) 


Now, (A) is a number, and if a number is equal to its complex conjugate, 
it must be real. Thus, (A) must be real for any state. It is a short step to 
show that all eigenvalues of A are real. 


4 An example of the first approach appears in Shankar (1980), pp. 28-29; the second 
approach in Liboff (2003), p. 105; the third approach in Sakurai (1994), Section 1.2. 

5 Equations (5.11) and (5.12) apparently require less stringent conditions on an oper- 
ator than its being Hermitian, since there the states that appear are some ket and the 
bra that correponds to that same ket, while the adjoint is defined between any bra and 
any ket. Ballentine (1998), Section 1.3, addresses this apparent discrepancy. 
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What are the physical implications of Hermitian operators? Postulate 2 
tells us that (1) physically observable quantities are represented by Hermi- 
tian operators, and that (2) the eigenvalues of such operators correspond 
to the possible values of measurements. The conventional (though not 
necessarily convincing) argument is, then, as follows. Observables must 
be represented by Hermitian operators because the eigenvalues of Her- 
mitian operators are real, and physical measurements always yield real 
numbers—never imaginary or complex numbers. 

I now list without proof three key properties of Hermitian 
operators.” 


e The eigenvalues of a Hermitian operator are real. 

e The eigenstates of a Hermitian operator are orthogonal. 

e For finite-dimensional spaces, the eigenstates of a Hermitian operator 
form a complete set. 


The third property implies an important result: because the eigenstates of 
a Hermitian operator form a complete set, any state may be expanded in 
a linear combination of the eigenstates of any Hermitian operator in that 
space. For infinite-dimensional spaces the third property presents subtle 
mathematical problems. Nevertheless, the result of a rigorous analysis is 
that the eigenstates of a Hermitian operator form a complete set even in 
such spaces. 

A very useful example is the calculation of an expectation value, 
such as (W|p|W). I will proceed very deliberately, so as to illustrate the 
fundamentally important principles that appear in this simple calculation. 

Because P is an observable, p must be a Hermitian operator; we take 
its eigenstates to be the |¢,)s. Let’s say that, in the P representation, 
|X) = cilo1) + cald2). Then (V| = cf (o1| + c3(¢2], and: 


(U|plv) = (U|(p|v)) = (alel + ¢(¢2|) (B{ e161) + c2|¢2) }). (5.14) 


In the first equality the brackets indicate explicitly that we will operate 
to the right. In the second equality we have expanded (W|p|¥) in the 
|b,)s—the eigenstates of p. By so doing, the action of p becomes trivial: it 
simply returns the corresponding eigenvalues. 


(alel + (del) (B{ e161) + €2|¢2) }) 
= (cf (d1| + c3 (421) (prerld1) + p2ealda)). (5.15) 


Because the |¢,)s are eigenfunctions of a Hermitian operator they are 
orthogonal. Moreover, we almost always normalize quantum states, so we 


6 Ballentine (1998), Section 2.2. 
T Proofs of these properties are readily available in textbooks. The second property 
must be modified for degenerate eigenvalues, but I will not discuss these. 
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take the |¢;)s to be orthonormal. Thus, the scalar product of two such 
states obeys (¢;|¢x) = jk. Completing the calculation, then, we have: 


piler|?(b1]¢1) + phc lhl) + pact c2(¢1|b2) + palea|? (¢2| 62) 
= piler|? +0 +0 + palea|?, (5.16) 


which is the two possible measurement values, each weighted by the prob- 
ability of obtaining that value—in accord with our expectations from 
Chapter 3. 8 

Simple though it is, this example embodies fundamentally important 
concepts and principles, such as change of representation, the action of an 
operator on its eigenstates, and orthonormality. Be sure you fully under- 
stand what was done in this example, and why! Failure to do so will likely 
haunt you later. 


5.2.3 Wavefunctions and Hermitian Operators 


The time has finally come to introduce some explicit forms for operators. 
Perhaps the most basic quantities in classical physics are position and linear 
momentum—corresponding to these in quantum mechanics are the position 
and momentum operators. The “derivation” of such operators, to the extent 
that they can be derived, is in general a difficult task.? I won’t derive these 
fundamentally important operators here, but I will introduce and discuss 
them. 

Suppose a one-dimensional system is represented by a position-space 
wavefunction, U(a,t). The basis states are the eigenstates of position, and 
as a result, the position operator in this case simply returns the correspond- 
ing eigenvalues, so that ĉ = x (see Section 2.3.2). Because the position 
eigenstates are not eigenstates of momentum, the momentum operator p, 
is not so simple, taking the form p, = —ih 0/Oz or, in three dimensions, 
p= —ihV. 

What if we had chosen to work in the momentum representation? Our 
wavefunction would then be a function of py and t, say ®(p,,t), and the 
basis states would be the eigenstates of p,,.!° In the position representation, 
we replaced ĉ with its eigenvalues, since it was acting on its eigenstates. In 
the momentum representation, the momentum operator may be replaced 
by its eigenvalues: jp, = pz, since Py now acts on its eigenstates. Corre- 
spondingly, it is now the position operator that is no longer simple, taking 


8 Since p is Hermitian, we would have obtained the same result if instead we had 
chosen to operate to the left with p (try it). 

9 Rigorous derivations are available, such as: Ballentine (1998), Chapter 3 and 
Section 4.1; Jordan (1975). A nice “plausibility” derivation of the linear momentum 
operator, which manages to avoid group theory while nonetheless being based on similar 
physical considerations, is given in Landau (1958), pp. 38-39. 

10 (pz, t) is the same state as (x,t), but in a different representation. It is not, 
however, the same function as U(x,t). This is discussed in more detail in Chapter 12. 
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the form: = ih 0/Op, (or in three dimensions, iAV, where the derivatives 
are taken with respect to pz, py, and pz). 

Other operators, such as those corresponding to angular momentum and 
energy (discussed in Chapters 8 and 9, respectively), may be constructed 
from the position and momentum operators, along with the corresponding 
classical quantity. For example, if we describe a system using Cartesian 
coordinates, and if the potential energy depends only on position, then the 
classical Hamiltonian of a system is simply its total energy: 


2 

PAR Vee ve) Ss 2 VG), (5.17) 
2 2m 

where K and V(x) are the kinetic and potential energies, respectively, 

m is the mass and, for simplicity, we work in only one dimension. We 

may construct the Hamiltonian operator for the corresponding quantum- 

mechanical system through the substitutions pp — p, and x — ĉ. In 

position representation: 


rat croget (n) iE (=) = +V(2).(5.18) 


Because in position representation < = x, the substitution x — ĉ changes 
nothing. This “quantization” procedure works also in momentum represen- 
tation, but because the explicit forms of ĉ and py are then different, so too 
will H take on a different form. We may now write the time-dependent 
Schrödinger equation in its general form, as in Postulate 1, and in explicit 
form for our one-dimensional system, in position representation:!! 


in? wle,t) = HW(x,t) = (=) Z tlet) +V(a)U(x,t). (5.19) 


Finally, note that because H is a Hermitian operator (corresponding to 
the total energy), there is an associated eigenvalue equation: 


AW,,(a,t) = (=) Z vnet) +V(2)¥,(2,t) = E,V,(2,t), (5.20) 


where W,,(z,t) is the nth eigenstate of H, and En, is the nth energy 
eigenvalue. 1? 


11 Tn essence, Postulate 1 takes the state to depend only on t, so the time deriva- 
tive cannot be a partial. In Eq. (5.19), however, the state depends on t and on gx, so 
differentiation with respect to t must be indicated with a partial derivative. This sort 
of notational subtlety occurs frequently when dealing with wavefunctions, for example, 
when using Ê and p. 

12 We will revisit Eq. (5.20), the time-independent Schrodinger equation, in Chapter 9. 
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5.3 Projection and Identity Operators 
5.3.1 Projection Operators 


We have now seen that superposition is pervasive in quantum mechanics. 
Let’s again write down a generic superposition: 


Ib) = X glg). (5.21) 
j 
Suppose Eq. (5.21) holds (i.e. |y) can be expanded in the |&,)s) even if 
|) is allowed to be any legitimate quantum state of the system. Then 
Eq. (5.21) constitutes a statement of completeness. That is, because any 
legitimate state can be expanded over the |g;)s, the |€;)s must constitute 
a complete set of basis states.!3 In words, Eq. (5.21) says that to expand 
some arbitrary |) in terms of the |€;)s, simply weight each |f;) properly 
and add them all up. 
If we take the |€;)s to be orthonormal, the proper weighting is 
just (€|~), the projection of |y} onto each |g;). Let’s now write our 
superposition out in a somewhat different form: 


l) = DJ olé = X Elo) = IE (Esl). (5.22) 
J j 
Here I’ve used the fact that the scalar product, (€;|q), is just a number 
(it’s just c;), and may be treated as such. 
For a single term, such as the j = 5 term, we can write: 


eslés) = (Esl) |és) = [és) (Eslw) = (Iés)(ésl)I¥)- (5-23) 


Bracketing |€5)(&5| doesn’t really “do” anything other than suggest we view 
\f5)(€5| as an object in its own right. But what is such an object? Evidently 
it’s not just a number. The role of a bra is to meet up with a ket and form 
a scalar product. And the role of an operator is to act on a state to form 
another state. Thus, |&5)(&5| plays the role of an operator: if it meets up 
with a ket on the right, say |a), the result is the new state |£5)(&5|a) (i.e. 
it is the state |&5) weighted by the number (&5|a)). So, while we may think 
of |€5) (€5|) in Eq. (5.23) as the ket |5} weighted by the scalar product 
(€5|#), we may instead think of it as the operator |&5)(&5|, acting on the 
state |y). 

Clearly, |) (&| acting on an arbitrary state |a} yields the j = 5 term in 
the expansion of |a) over the |£;,)s; that is, it creates a new state consisting 
simply of |s} weighted by the projection of |a) onto |€5). Accordingly, we 
call |€5)(€5| a projection operator, and denote it Ps. 


13 This is not to be confused with the proposed completeness of quantum mechanics 
as a description of the world (see Chapter 3). 
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5.3.2 The Identity Operator 


By definition, the identity operator leaves unchanged any state upon which 
it acts. It would thus seem trivially simple. But in quantum mechanics the 
term identity operator often refers to much more than simply doing nothing 
to a state. 

What if we were to act on a state |y} with not just one projection 
operator, but with a new operator consisting of the sum of all projection 
operators for a given set of orthonormal basis states? Then we would simply 
regain Eq. (5.22): 


> Plt) = IENEI = I). (5.24) 


Thus, the action of the operator 7; Ê; = >; lz) {&z| is just to return the 
state upon which it acts. In precisely this sense, ` j Êj = Î , the identity 
operator. 

In another sense, however, >> j P; performs a critically important func- 
tion: it expands a state (in this case |w)) in a particular set of basis states 
(in this case, the |€;)s). By analogy, if we transform a vector in ordinary, 
three-dimensional space, three-space, into another coordinate system, the 
vector itself remains unchanged; but we’ve nonetheless done something 
very important. The result of translating War and Peace from Russian 
to English is still War and Peace, and in that sense, the translator has 
“done nothing” —but the translator will feel he’s done quite a bit! 

The identity operator, in the sense used above, is thus really a means to 
effect a change of basis, or representation. Because it changes the language 
in which the state is written, but doesn’t alter the state itself, we can insert 
[= a Ê; wherever convenient. In fact, this operator may act not only 
on states but also on other operators (discussion of which we will defer to 
later chapters). 

As a simple example of the power of “doing nothing” with the identity 
operator, suppose we wish to calculate the scalar product (3|V), but don’t 
know how to do so directly. We do, however, know how to expand |6} in 
the € basis, and |W) in the ¢ basis, and we can find the projections of the 
€ basis states onto the ¢ basis states. Then we can immediately write 


(BIE) = XO (BIE) (Elon) (bal W)- (5.25) 


j,k 


5.4 Unitary Operators 


In addition to Hermitian operators, unitary operators constitute a funda- 
mentally important class of quantum-mechanical operators.!4 To develop 


14 Closely related to unitary operators are anti-unitary operators, which act as unitary 
operators that also complex conjugate any constants. The time-reversal operator, which 
we will not further discuss, is probably the most important anti-unitary operator. 
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the concept of a unitary operator let’s first define the inverse of an operator 
Ü as another operator, U~', such that, 


uu =0 0 = Î, (5.26) 


where Î is the identity operator (now in the simpler sense of an operator 
that truly does nothing to a state).'° By definition, if the inverse of some 
operator U is equal to its adjoint, that is, if 


Ut=0t, (5.27) 
then U isa unitary operator. Thus, for such an operator, 
0-0 = tÔ =F. (5.28) 


As with any operator, unitary operators have the effect of changing 
a quantum state into another quantum state. In what follows, primes 
denote new states created by a unitary operator U acting on the states |W) 
and |Q): 


U|v) =|W’), and UIQ) =|’). (5.29) 


Moreover, by definition of an adjoint operator, (¥|Ût = (W’|, and (Q|Ut = 
(Q'|. Now, the scalar product of one state with another (normalized) state 
gives the projection of one onto the other. Let’s examine the scalar product 
of the primed states, and compare it to that of the unprimed states: 


(VIV) = (QÔ) (O]W)) = (QTD |W) = (QOY) = (QP). (5.30) 


So unitary transformations leave the scalar product unchanged, or 
invariant. 

Now let’s go back to our old friend, the geometrical vector in three- 
space. Consider the scalar product between two such vectors A and B 
in this space. What can we “do to” our system of coordinate axes and 
vectors in this case such that we obtain a result analogous to that above 
for quantum states? That is, what can we do to our coordinate axes and 
vectors A and B such that the scalar (dot) product, A-B, is unchanged? 
Suppose we identically rotate all vectors in our system (except the basis 
vectors). Clearly, this cannot alter the vectors’ lengths. Moreover, regardless 
of the new orientation of these vectors with respect to the fixed coordinate 
system, their orientation with respect to each other remains the same, and 
thus their dot product remains the same. 

So rotations in three-space preserve relations between vectors, while uni- 
tary transformations have a similar effect, in the sense that they leave scalar 
products of quantum states unchanged. Thus we may think of a unitary 
transformation as a rotation in Hilbert space—the abstract, mathematical 
space in which quantum states live. 


15 Some authors use 1 rather than Î. You should realize that a product of operators, 
by itself, can’t be a number! 


64 Operators 


Unitary transformations are used in part to investigate conservation 
principles and symmetries. To pursue such investigations in depth would 
require us to venture into the wild and woolly world of group theory, which 
we will not do.'® We can nevertheless examine some of the most important 
basics regarding unitary operators that may prove useful later. 

We have seen that the physical meaning of Hermitian operators is 
that they correspond to observables. What is the physical meaning of uni- 
tary operators? For our purposes, unitary operators effect transformations 
of quantum-mechanical systems in space and time. Such transformations 
include rotations, boosts (changes in velocity), translations in time, and 
translations in space (along an axis). Unitary transformations may be car- 
ried out on either the states or the observables (Hermitian operators) with 
equivalent results.!” 

To illustrate this, consider the behavior of the expectation value 
(Y|A|W) under a unitary transformation. Proceeding as in Eq. (5.30), we 
write, 


(W/|A]W’) = (WOT) A(T|¥)) = (v (Ôt AÔ) |W). 6.31) 


Equation (5.31) expresses the fact that we may regard the transformed 
scalar product either as the expectation value of A in the transformed state 
|W’), or as the expectation value of the transformed operator Ut AU in the 
untransformed state |W). This illustrates the fact that we may transform 
either the states, by operating with U (and U t), or the observables, with 
a transformation of the form Ut AU. 

Although for our purposes unitary operators are of secondary impor- 
tance, one such operator appears routinely in quantum mechanics, 
and plays a central role in what follows: the time-evolution operator, 
exp|—iH (t — to)/h], where H is the Hamiltonian operator, t is the time, 
and to is the initial time.'® The role of the time evolution operator is to 
propagate, or “time evolve,” quantum states from initial time to to time t, 
and therein lies its great utility. We will have much more to say about the 
time evolution operator later. 


5.5 Problems 


1. Consider a quantum system which requires only two basis states to 
span the space. For the operator G, the basis states are |G) and |G2). 


16 Unitary operators and transformations are discussed further, however, in 
Appendix D. 

17 These two equivalent methods of carrying out a unitary transformation, that is, 
transforming the states or the observables, are often referred to as active and passive 
transformations, respectively. 

18 This form of the time-evolution operator is derived by separating variables in the 
time-dependent Schrodinger equation. This requires that HI be independent of time; 
thus, this form generally holds only for such Hs (cf. Chapter 11). 
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A mythical “black box” prepares the system in the state |G2). Another 
operator, A, has eigenstates |a1) and |ag) that span the space, with 
eigenvalues a, and ag, respectively. 

The eigenstates of the Hamiltonian operator, H, are |E,) and |E3). 
These are needed to time evolve the system; Pll explain below. Your 
goal is to calculate the time-dependent probability distribution for 
measurements of A. I’m going to help you through this. 

Use bra-ket notation wherever possible, including for the expansion 
coefficients. 

(i) We will develop time evolution in a later chapter. For now, PIH 
simply provide you with what’s needed. Time evolution is carried out 
by the action of an operator U(t), where: U(t) = e~*#*/P, Tt can 
be shown that if U(t) acts on, say, the jth eigenstate of H, with 
eigenvalue E;, then the action of Û (t) is simply U(t) — e7*2st/P, 
Now expand the initial state |G2) in a basis that will allow you to 
time evolve it. 

(ii) Call the time-dependent system state |y(t)). Write down |:)(t)) 
in terms of the expansion of Part (i). Simplify |W(¢)) as much as 
possible—remember that the point is to replicate what a practicing 
physicist would actually do, that is, to obtain something that you 
could actually calculate. 

(iii) Write down eigenstate expansions that will allow you to calculate 
the probability distribution for A from |2)(t)). 

(iv) Rewrite |¢(t)) using the result of Part (iii). 

(v) Find the probability distribution for a and az. 

(vi) Write a clear and coherent summary (in words!) describing what 
you did in Parts (i) through (v), and why. 

. In Chapter 2 we first introduced the idea that observable quantities 
are represented by Hermitian operators. But it is worth examining 
the meaning of this statement. 

In what way does the mathematical process of acting on a state with 
a Hermitian operator correspond to the physical process of making 
a measurement? In what way does it fail to correspond to mak- 
ing a measurement? (Hint: you will probably need to consider the 
changes brought about by measurement, per Postulate 3, to answer 
this question.) 

. In general, the fact that the average value of some quantity is real (not 
imaginary or complex) certainly doesn’t mean that each individual 
value over which the average is taken must be real. 

If an operator is Hermitian, the expectation value for the operator is 
real for any quantum state. This statement alone is sufficient to guar- 
antee that any individual eigenvalue of the operator is real. Explain 
why this is so. 

Please refrain from reproducing proofs, which are easily found in 
texts, that the eigenvalues of Hermitian operators are real. Rather, 
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think carefully about the second sentence of this question. You 
should be able to answer the question convincingly using little or 
no mathematics. 

4. Suppose D|5) = |ô}, where |ô} is an arbitrary state. Write an equation 
indicating the action of the operator Dt on the bra state corresponding 
to |ô}. 

5. If B is a Hermitian operator, then any state may be expanded in 
the eigenstates of B. What property of Hermitian operators does this 
reflect? i 

6. Consider a unitary operator U(y) = e7*@7/". Assume that Gl) = 
gk\¢n), and take |W) to be a superposition of two eigenstates of Â, 
i.e., |Y) = cala) + cg|g)- 

(i) Determine whether the inner product (|W) is preserved under the 
indicated transformations. 

(a) (Ww) — (W|O(™)|) 

(b) (wlw) — (WOT) Iw) 

(c) (UY) — (WOO) 

(ii) It is sometimes said that “inner products are preserved (left 
unchanged) by unitary operators.” Given your results in Part (i), 
clarify/explain what is meant by this statement. 

(iii) Perhaps the most important unitary operator is the time evo- 
lution operator, denoted U(t), which advances a quantum state 
forward in time. Explain why your answers to Parts (i) and (ii) 
make sense physically, for the case of time evolution. (Why would 
you expect them to hold, based on what is being time evolved in 
Part (i)?) 

7. The operators A and B, with eigenvalues a; and (;, respectively, 
share eigenstates. The orthonormal set {|j)}, where j = 1, 2,3..., 00, 
comprises these eigenstates. Suppose the system state is |y}, and that 
|) is not an eigenstate of A or B. 

(i) Write down three equations that concisely state the information 
provided in the preceding paragraph. 

(ii) Calculate (y|Al7). Be explicit. 

(iii) Calculate (y|Ĉ|y), where C = AB. Be explicit. So far, we are 
taking operators to act only to the right. We regard C as meaning 
that first B acts, and then A. 

(iv) The states |A) and |Ay) are (different) superpositions of 
the |j)s. Write out equations that state this (and thus define the 
expansion coefficients). Then calculate explicitly (A,|A|A2) and 
(As| Alan). 
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(v) Suppose that the set {|J)}, where J runs from —2 to 3 in integer 
steps, forms a complete set of basis states for some system. Also, 


G|J) = gs|J), and (5.32) 
3/2 

w= So esl), (5.33) 
J=—3/2 


where gy = h/J, and the unnormalized cys are proportional to /| JI. 
Obtain the normalized cys, and then calculate (U|G|V). 


6 
Matrix Mechanics 


Heisenberg criticized Schrodinger’s approach... In a letter to Pauli he even wrote: 
“The more I ponder about the physical part of Schrodinger’s theory, the more dis- 
gusting it appears to me.” Schrodinger was not less outspoken about Heisenberg’s 
theory when he said: “I was discouraged, if not repelled, by what appeared to me 
a rather difficult method of transcendental algebra, defying any visualization.” 
Max Jammer! 


Built on the labor of many physicists, quantum mechanics was forged 
over many years. A lack of coherence plagued “the old quantum the- 
ory” until finally, in 1926 and 1927, quantum mechanics was put into 
a coherent form. Almost simultaneously, two separate formalisms were 
introduced: Erwin Schrédinger’s wave mechanics and Werner Heisenberg’s 
matrix mechanics. 

Wave mechanics was warmly welcomed—it was based on the familiar 
mathematics of differential equations, and seemed amenable to under- 
standable physical pictures. But Heisenberg’s matrix mechanics was highly 
abstract, and seemed to have little clear connection to a physical inter- 
pretation. The original matrix mechanics was far more opaque than that 
version which eventually emerged, and that is widely used today. Never- 
theless, even this distilled version can seem abstract and unclear, at least 
on first acquaintance—a hurdle I hope to lower in this chapter. 


6.1 Elementary Matrix Operations 
6.1.1 Vectors and Scalar Products 


Before developing the matrix formulation of quantum mechanics—matrix 
mechanics—we first briefly review some elementary definitions and oper- 
ations with vectors and matrices. You may have covered this ground in 
elementary algebra, but it’s worth presenting here because it provides a 
concise and convenient review and reference, and because it’s specifically 
intended to lay the groundwork for our subsequent development of matrix 
mechanics. 


1 Jammer (1966), p. 272. 


Elementary Matrix Operations 69 


Consider a vector V in three-space. Writing out V ina basis comprised 
of the unit vectors @, g, and %, we have: V = Vpê + Vyy + V2.2. But if 
it’s understood from the outset that we are working in the ĉ, y, 2 basis, 
it’s unnecessary to restate that fact when writing out V. In that case, V 
may be fully specified by specification of its components along each basis 
vector.2 Thus, I could write V as either a 1 x 3 (one row, three column) 
row vector, or as a 3 x 1 (three row, one column) column vector: 


V= (Va Vp V) of V= | Y |. (6.1) 


Now introduce a second vector, say U. The scalar (dot) product of 0 
and V is: 0-V = U,V, + U,V, + UzVz. We define multiplication of row 
and column vectors in consonance with the scalar product, that is,’ 


Vz 
Ü- V = (Uz,Uy,Uz)| Vy |= XO UjVj =UV; + UyVy + UzVe. 
V, j=z,y,z 


(6.2) 


The concept of a row vector, a column vector, and the scalar product 
of two vectors may be immediately generalized to an n- dimensional space, 
where n is some arbitrary positive integer. Then the row vector Ü is: U = 
(Ui, U2,...,Un), and similarly for the corresponding column vector. The 
scalar product of two n-dimensional vectors U and V is then: 0-V = 
Xj U;V;. 


6.1.2 Matrices and Matrix Multiplication 


A matriz has multiple rows and columns. In the matrix formulation of 
quantum mechanics we will be concerned exclusively with square matri- 
ces—those with an equal number of rows and columns. The elements of 
a matrix are denoted by a double subscript that indicates the row and 
column of the element, in that order. Thus, an n x n matrix M is written: 


My, Miz Min 

f Ma Mu ... Ma 
M= i Kin . ; (6.3) 

Mni Mne2 Mnn 


2 This point was also made in Chapter 4. 


Uz 
3 Note that it is not legitimate to write U-V as: ( Uy (Vz, Vy, Vz). Such an 
Uz 
expression yields, by definition, an outer product, which is a matrix (in this case, of 
dimension 3 x 3). 
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Multiplication of the n x n matrix M by an n x 1 column vector V on 
the right results in another n-dimensonal column vector, which I’ll denote 
V’. In particular, ey Mi,;V; = V{. Illustrating with the simplest case of 
m=? 


( Mı Miz Vi \  ( Maat Mee) (VY (6.4) 
Mz, Mə Vo MoV, + Mo2V2 Vz J` i 


Similarly, multiplication of the n x n matrix M by a 1 x n row vector 
V on the left results in another n-dimensonal row vector V’. Explicitly, 
ie ViMiy = Vj, so for the n = 2 case, 


My, M 
(Vi, va) ( Ms, Ma ) = (ViMii + V2Mo1, ViMi2 + V2Mo2) 


= (Vi, V2). (6.5) 


Finally, suppose we wish to multiply M on the left with another n x n 
matrix F’, on the right. The result is a new n x n matrix, which we'll denote 
G. An element of G is given by: ey MikFkj = Gij. For the n = 2 case, 


My, Miz Fa Fiz Ne, 

Mz, Mo Fo, Fo 
Muti + Mi2Fa Mirti + MieF2 \ _ ( Cu Gis (6.6) 
Mo Fy. + Mə2F21 = Mo Fie + M22 F22 G21 G22 f 


6.1.3 Vector Transformations 


Now suppose we wish to “do something to” the n-dimensional vector V. 
Perhaps the simplest example is to again set n = 2 and rotate V through an 
angle 0, as in Fig. 6.1. Let’s call the new, rotated vector V’. Careful analysis 
of Fig. 6.1 shows that V/ depends on both components of the original vector, 
Vz and Vy (and, of course, on 0), and similarly for V,; specifically: 


V} = V, cos 0 — V, sin 8, (6.7) 
V, = Va sin 6 + Vy cos 0. (6.8) 


This is a general feature of vector transformations: any one component of 
the transformed vector will in general depend on all components of the 
original vector. 

A natural means to effect such a transformation is with a matrix. To 
carry out the rotation of Fig. 6.1, for example, we may define a rotation 
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> 
x 


Fig. 6.1 Rotation of a vector V through an angle 6; the dashed lines indicate 
the projections onto the x axis. 


matrix R(0) such that 


“y+ f cos —sind Ve yore Me 
ee ( sinô  cosé ) ( Vy ) = ( Vy J` (6.9) 
Evidently matrices are well suited for carrying out such transformations. 
Before returning to quantum mechanics, I emphasize that most vectors 


and matrices in this section make no explicit reference to a basis. Rather, 
they are implicitly written with respect to the ĉ, ĝ, 2 basis. 


6.2 States as Vectors 


As for an ordinary vector, a quantum state written in a basis with discrete 
basis states is conveniently represented as rows, or columns, of coefficients. 
For an ordinary vector, these coefficients are the vector’s components, its 
projections onto the chosen basis vectors. For a quantum state, the coeffi- 
cients are the expansion coefficients, the projections of the state onto the 
chosen basis states. For both cases, it must be understood what basis we 
are working in. 

Unlike the components of ordinary vectors, the expansion coefficients 
for a quantum state are, in general, complex. The correspondence between 
bras and kets is: |b) = )7, ¢j|¢3) — (Y| = DU, cj (¢;|- In matrix mechan- 
ics a ket is represented as a column whose elements are the expansion 
coefficients, the cs, while the corresponding bra appears as a row whose 
elements are the cys. Thus, the scalar product of a two-component state 
|) with itself is 


(ole) = (eh, ot) (St) = la + lea (6.10) 


A few simple but important observations are in order. 
First, note that if U and V are two ordinary vectors, then U -V = V.U 
(cf. Eq. (6.2)). But this clearly does not hold for two quantum states, say |Y) 


72 Matrix Mechanics 


and |£}. That is, (w|€) = (Ely)" # (Ely). This shouldn’t be too surprising: 
for the continuous eigenvalue case, the scalar product of two functions f(x) 
and g(x) is 


(sls) = f "Paoi f T gla) #e)de. (6.11) 


The order of the bra and ket in a scalar product matters. 

Note also that when states are represented as vectors, the relation 
between a basis state and a superposition is particularly clear. Because 
the vector’s components are just the expansion coefficients (or their com- 
plex conjugates), then if the state is a basis state in the chosen basis, all 
components must be zero except for one of them, which will be 1. If the 
state is a superposition in the chosen basis, then at least two entries must 
be non-zero. Suppose the {¢;}s constitute an n-dimensional basis; then 
(Pilk) = ĝjk expresses orthonormality. It’s easy to see how this statement 
is manifested in matrix mechanics: 


where j and k can take on any value from 1 to n. 


6.3 Operators as Matrices 
6.3.1 An Operator in Its Eigenbasis 


In Section 6.1.3 I illustrated how matrices transform vectors. We’ve also 
seen that the way to “do something to” a quantum state is with an operator. 
Thus, if a state is written in the form of a row or column vector, the way 
we do something to it is with an operator written in the form of a matrix. 

I’ve emphasized in this chapter that one must choose a basis—whether 
formed by ordinary basis vectors or by quantum states—and work within 
that basis. As we will see, this is true for both states (vectors) and operators 
(matrices). 

As a simplest first example, suppose |¢1) and |¢2) satisfy the eigenvalue 
equation Al¢;) = a,|@;), and form a basis, the eigenbasis of A. Suppose 
we choose to work in this basis, and to let A act on its eigenstates. Then a 
2 x 2 matrix with diagonal elements a, and ag, and off-diagonal elements 
0, will evidently do the job: 


Al¢1) = a1|¢1) — ( a A ) ( ; ) =a ( ; i: (6.13) 
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and similarly for j = 2. A matrix corresponding to an operator, such as that 
in Eq. (6.13), is often called the matrix representation of the operator.* 
Far from applying only to the eigenstates of A, though, the matrix in Eq. 
(6.13) is the correct matrix representation of A for any state written in the 
@ basis. To see this, consider a state |W) that is an arbitrary superposition 
of |¢1) and |¢2), that is, |Y} = c1|¢1) + c2|¢2). The action of A on |W) is: 


Ât) = A(cild1) + c2ld2)) = arcildr) + azcald2). (6-14) 


This is precisely the effect of the matrix of Eq. (6.13): 


Ceo a (615) 


6.3.2 Matrix Elements and Alternative Bases 


I did not derive the correct matrix representation of A:I simply wrote it 
down. Similarly, I could write A in its eigenbasis as 


j= ( (ilAler) (os1Alo) 
2 ( (¢2|A]d1)  (¢2|Al¢2) | (6.16) 


where the only justification (for now) is that it works; that is, Eq. (6.16) 
reproduces the matrix of Eq. (6.15). The various (¢;|A|@x)s are the matrix 
elements of A in the ¢ basis. Clearly, because we are working in the basis 
formed by the eigenstates of A, the diagonal elements are the eigenvalues 
of A, and the off-diagonal elements are 0. 

Until now, the action of an operator on a state was dealt with by expand- 
ing the state in the operator’s eigenbasis. In Chapter 5, however, I suggested 
that an operator could be written in a basis other than that formed by its 
eigenstates. To do so, the matrix elements are written in terms of the basis 
of choice. Suppose, for example, that the |)s are not eigenstates of A. In 
the x basis, then, A becomes: 


j= ( alder) Gal Abr) 
an ( (x2|Alx1) (x2lÂlx2) ) (6.17) 


Note that this is the same operator as in Eqs. (6.15) and (6.16), but now 
expressed in the x basis. 


4 Clearly, “representation” is used here in a different sense than in Chapter 4, where 
it referred to a particular basis. 
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In this basis the off-diagonal elements will not, in general, vanish. If, 


for example, |x1) = a1|¢1) + @2|¢2), and |x2) = 1|¢1) + 92|¢2), then the 
off-diagonal element Aj is 


Ay. = (x1/A]x2) 


(ailal + a3 (21) (abile) F azßəlġ2) ) 
= 41051 + a203b2, (6.18) 


which in general does not vanish. It shouldn’t be hard to also convince 
yourself that a diagonal element, such as (x1|Âlxı), will no longer simply 
be an eigenvalue of A 

Suppose that one must deal with a state that is known in one basis, and 
an operator known in another. We may transform either the state or the 
operator to the other basis. One reason it may be convenient to choose the 
latter approach arises if we will need to repeatedly use our operator with 
different states. If we transform states, we will need to do so for each new 
state. But if we transform the operator, we need only do so once, for once 
we have obtained the operator in that basis, it works for all states written 
in that basis. 

Many textbooks supply formal derivations that the matrix representa- 
tion of an operator in an arbitrary basis takes the form of A in Eq. (6.17).5 
Instead, I will provide a qualitatively accurate explanation of why this is 
so, using A of Eq. (6.17) as an example. 

Suppose A acts on a state |u) = (xalu) x1) + (vale) |v2) to form a new 
state |w’). Writing this out in explicit matrix form, we have 


(pate pale) RB = CRB). ean 


Let’s examine the |1) component of the new state |p’): 


Qalu) = (xa lAlxa) oala) + Oca Alxe2) (val). (6.20) 


What is this equation telling us? 

Consider first the term (x1|A]y1)(x1|/). Because |y1) is not an eigen- 
state of A, the state A]y,) must be a superposition of the basis states 
lv1) and |x2). The second factor, (x1|), is the projection of the state |u) 
onto the basis state |): it’s the “amount” of |x 1) in |). Writing the first 
factor as (y1|(Aly1)) suggests interpreting that factor as the projection of 
the state Alx1) onto the basis state |y,). That is, it’s the amount of |x1) 
in A]x1). i 

In words, (x1|A|x1}(xıl|u) is the amount of |x1) in the system state |u}, 
times the amount of |y1) in A|y1), the state created by the action of A on 
lv1). Similarly, (x1|Alx2)(x2|") is the amount of |x2) in the system state 


5 See, for example, Liboff (2003), Section 11.1. 
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|u), times the amount of |y1) in Aly2), the state created by the action of 
A on |x). The sum of these is the total contribution of |y1) to the new 
system state |’), that is, it’s (yi|u’). An analogous argument applies to 
the |yv2) component of |p’). 


6.3.3 Change of Basis 


Changing representations is a fundamentally important quantum- 
mechanical operation. Moreover, by examining the matrix elements that 
effect such a transformation, we can better grasp the structure of operators 
when represented by matrices. 

TI denote as S a matrix that transforms a ket (a column vector) from 
the x basis to the ¢ basis. Consider a state |u) = (x1|)|x1) + (vale) |x2) = 
(hilali) + (b2|)|2). The transformation matrix Ô must satisfy 


Sir Sie (xala) ) ( (Qıl) ) 
= ; 6.21 
( S21 S22 ) ( (xala) (olu) ( ) 
It turns out that S must take the form 


a f (bilxi) (¢1lx2) 
a= ( (2lx1)  (balx2) i (6.22) 


But why? 
Let’s write out one coefficient of the transformed state explicitly: 


(bile) = (h1lxa) xala) + (P1lx2) (x2le)- (6.23) 


From Chapter 5 we know that the identity operator for this system may 
be written: J = |x1)(x1| + |x2)(x2|. Thus, Eq. (6.23) is simply: (¢1|u) = 
(1 |L|1). Nevertheless, it’s somewhat instructive to dissect the terms in Eq. 
(6.23). 

The term (61|x1)(xi|/) is the projection of |u} onto |x1) times the pro- 
jection of |x1) onto |1}. Restated, it’s the amount of |x) in |) (the state 
we’re transforming) times the amount of |1} in |x1). Thus, this term gives 
the amount of |¢1) in |u) due to its |x) part. Similarly, (¢1|x2)(xv2lL) 
gives the amount of |1) in |u) due to its |x2) part. The sum of these 
two contributions then gives the total amount of |¢,) in |u}. An analo- 
gous argument holds for the other component of the transformed state, 
(¢2|4). So, S in Eq. (6.22) indeed transforms from the y basis to the ¢ 
basis. 


6.3.4 Adjoint, Hermitian, and Unitary Operators 


Adjoint, Hermitian, and unitary operators assume particular forms in 
matrix mechanics. Moreover, matrix mechanics affords a means to make 
these rather abstract concepts more concrete. 
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By definition, the adjoint of an operator B is another operator Bi that, 
acting to the left, yields the bra corresponding to the ket created by B 
acting to the right (cf. Chapter 5). That is, if Bl) = |W’), then (|t = 
(w"|. In matrix mechanics, an adjoint operator, such as Bt, is the complex 
conjugate of the transpose of the original operator, B. That is, Bi, = By; 
Thus, if: 


i Bıı Bie ) St ( By, By ) 
B= Shey Bhs 2a), 6.24 
( Bi, B32 (62) 


To see why this condition indeed yields the adjoint of an operator, 
I first need to clarify the phrase ‘acting to the left’. Though commonly 
used, this terminology is a bit misleading. In matrix mechanics, saying that 
(p| Êt denotes Bt acting to the left on (y| really means that the row vector 
(y| multiplies the matrix Bi in the usual way, acting to the right. Thus, 
(|B is 


" ” 5 B* B* 
(Ét = (4, &)( pe pe 
B32 


12 
= (By t Bly, ABa + B39) = W, (6.25) 


while B|) is 
z Bıı Bie C1 Bic, + B12C2 ! 

I) ( Bo, Boe C2 Bac, + Bo2c2 W) ( ) 
which shows that the conjugate transpose of an operator does indeed yield 
the adjoint. 

Since a Hermitian operator is self-adjoint, its manifestation in matrix 


mechanics is now obvious. If Ê is Hermitian, then B= Bi, so the matrix 
elements must obey: 


Bjk = Bt, = Bý; (6.27) 


That is, a Hermitian operator must equal its transpose conjugate (its 
adjoint). 

The defining characteristic of a unitary operator Û is that its adjoint 
equals its inverse: Ut = Û-!. Thus ÛtÛ = Û-!Û = Î. An example is an 
operator that effects a change of basis, such as ĝ in Eq. (6.22). One can 
easily check that S satisfies the unitarity condition. 

In Section 6.3.3 we saw how the matrix § transforms a ket from the 
x basis to the ¢ basis. As pointed out in Chapter 5, however, we can also 
transform the basis of an operator, via a unitary transformation (more 
explicitly, a unitary-similarity transformation). If eR is an operator in the 
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x basis, then Cis the same operator in the ¢ basis, is 
Ge = $14,8 = S1G,8. (6.28) 
The last equality holds, of course, because S is unitary. 


6.4 Eigenvalue Equations 


Consider a matrix mechanics eigenvalue equation for the simplest case of 


a two-state basis 
Ai. At C1 C1 
= : 6.29 
E A22 C2 j C2 ( ) 


Note that Eq. (6.29) does indeed fulfill the definition of an eigenvalue 
equation—an operator acting on a state must yield the same state, times 
a constant. The values of a that satisfy Eq. (6.29) are the eigenvalues. If 
Aj and/or A21, as well as both cı and c2, are non-zero, then we evidently 
are not working in the eigenbasis of A. 

Suppose we transform the state, and A, into A’s eigenbasis. Denoting 
the transformed matrix elements by primes, Eq. (6.29) takes the form 


( i me )(o)=( 0 » )(op=alo): (6.30) 


for the eigenstate with eigenvalue a, and similarly for the eigenstate cor- 
responding to a2. Appropriately, transforming to an operator’s eigenbasis 
is often called diagonalizing the operator. Clearly, if we can transform our 
eigenvalue equation into the eigenbasis of A—if we can diagonalize it—then 
we can simply read off the eigenvalues. 

An algebraic route to solution of the eigenvalue problem in matrix 
mechanics is also available. An equation such as Eq. (6.29) is really a set 
(in this case, two) of algebraic equations 


(Ayre jan \(a)a(2). ay 


From algebra, we know that solutions to such a set of equations are obtained 
by setting the corresponding determinant to zero: 


=0. (6.32) 


Aji — 4a A12 
Aoi Ag2 — a 


This yields a quadratic equation which is to be solved for the eigenvalues. 
For this two-state basis, this is a simple algebraic problem, with at most 
two eigenvalues (a; and a2). Complexity increases very rapidly, however, 
as the number of basis states increases. 
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By focusing on two-state bases, our discussion in this chapter has been 
relatively simple and concrete. Although calculations may become very 
complicated for higher-dimensional bases, the fundamental ideas developed 
herein remain valid, and widely applicable. 


6.5 Problems 


1. Consider an operator Î = exp(—iGy/h), where y is a parameter. 
Assume that G]¢x) = ge|¢x), and that k runs from 1 to 2. 
Use the step-by-step instructions below to calculate, as far as is possi- 
ble, the matrix element T12 in the 8 basis, defined by Ê Gn) = On| Bn) 
(assume that B and G do not share eigenstates). 
(i) Write out T12 in the 8 basis in bra-ket notation. Don’t do any 
calculations yet. 
(ii) Construct an appropriate identity operator I (a “sum of projec- 
tors”) that will serve to convert the states in the inner product to a 
basis that will facilitate calculation. 
(iii) Insert your Î from Part (ii) into the expression of Part (i) as 
necessary to carry out the calculation. (Hint: you need to insert I 


twice.) 
(iv) Carry out the calculation as far as possible with the information 
given. . 

2. For a spin 1/2 system: a| +a) =+8|+a), where a can be x,y, or z. 


+y)= 1//2|+2) =P 


Note that the y and z bases are related by: 
i/v2| - 2). 


(i) The operator Sy, in matrix representation, and in the z basis, is 
3 0 —iħ/2 
Sy — ( ae 0 ) , 


where I have not supplied the matrix element 5,21). (The subscripts 

1 and 2 correspond to + and —, respectively.) Calculate S,(21). Be 

explicit, so what you are doing is clear. 

(ii) State (in words) what kind of equation you should get if you 

carry out the matrix-mechanical operation ô| +y) using your answer 

to Part (i). Then carry out the calculation, to show that you get this. 
3. This mega-problem is not terribly difficult, but it does have many 

pieces. 

Consider again a two-state basis. One set of basis states is comprised of 

|a+) and |a—}; another of |G.) and |G_). The basis sets are related by: 


1 i 
PoS p EE ag ) 
1 i 


|8-) = =5le+) — z5le-) 
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You’re interested in this system because your experimentalist col- 
leagues have obtained some puzzling preliminary results. They’ll need 
to run many more experiments, and calculate the predicted results for 
many different initial states. 

Using matrix mechanics, and working in the a basis, do the following. 
(i) Write out |a_) and |a_). 

(ii) Write out |G.) and |G_). 

(iii) Write out (84| and (G_|. 

(iv) Writing matrix elements as inner products, construct a matrix 
Sap that transforms states from the a basis to the 8 basis. (Regard 
the + and — states as corresponding to element subscripts 1 and 2, 
respectively. Thus, for example, S;_ occupies the 1,2 position in the 
matrix.) 

(v) Evaluate the elements of Sap, and rewrite Sop using these matrix 
elements. 

(vi) A change of basis represents a rotation in Hilbert space, and this 
corresponds to a unitary transformation. Use the defining condition 
for unitarity to determine whether Sop is unitary. 

(vii) Show that ag either is or is not Hermitian. 

(viii) The experimentalists can prepare states that are known in the 
a basis. The eigenstates of the observable that they’re interested in, 
though, are the |3)s. The operator corresponding to this observable 
is B, which satisfies 


(|b) = r|), 


where «x is a constant. Write down the matrix B, in the @ basis. 

(ix) Use matrix mechanics to calculate (7|3|w), the expectation value 
of 6, where |y) is an arbitrary state. From Part (viii), |W) is known 
in the a basis; but 3 is known in its eigenbasis. To calculate (8), 
transform the state(s) to the 8 basis. 

(x) As a conscientious researcher, you always try to verify that your 
result makes sense. Check that your result in Part (ix) is the sort of 
mathematical object that must be obtained for the expectation value 
of a Hermitian operator. 

(xi) Your colleagues are hoping to publish their results, with you as 
a co-author, in a prestigious journal, such as Physical Review Letters. 
They’re depending on you to get things right. (It’s one thing to be 
wrong; another to be wrong, in print, about a simple calculation.) To 
check your own work, you also calculate (4| B |) by another method, 
in which you transform the operator rather than the state. Carry out 
this transformation. Is the resulting operator Hermitian? Is this what 
you expect? 

(xii) Calculate (7|3|%) using the transformed operator of Part (xi). 
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(xiii) Check that the results of Parts (ix) and (xii) are the same. Argue 
on physical grounds that they must be. (Hint: what does (4| Êl) 
represent physically?) 
You now have a prediction for (3) in any state. Your experimental- 
ist colleagues can compare your predictions with their measurement 
results. 

4. Suppose the states |¢;) and |¢2) form a basis. 
(i) Working in the ¢ basis, construct a 2x 2 matrix, denoted Ê,, whose 
action on any state is to project out the |¢2) part. 
(ii) Write out a matrix equation that shows explicitly that Ê, works 
correctly on an arbitrary state. Then write out the corresponding 
equation in bra-ket notation. Explain clearly, in words, why your 
matrix P> does what you want. 

5. The states |+z) and |— z) form the z basis of a spin 1/2 system. 
Working in this basis, explicitly construct the matrix representations 
of Ê, and P_, the projection operators onto the states |+ z) and |— 2), 
respectively. Then check your results by letting each matrix act on an 
arbitrary spin 1/2 state. 

6. The Hamiltonian operator, H , corresponds to energy. For some 
quantum system, the matrix representation of H is 


Q1 0 0 
0 a 0 |, (6.33) 
0 0 Q3 


where a; = j°h?. (For dimensional consistency, you may assume j* 
has dimensions of [mass x length?]~+.) 

(i) What are the possible measured values of the energy for this 
system? 

(ii) Suppose the system is prepared in the state |W) = c1|E1)}+c2|E2)+ 
c3|E3), where the |E,)s are the eigenstates of H. Suppose, further, 


that 
(t= yi. 


What are the probabilities for obtaining each possible energy value? 
(iii) An operator A satisfies: Ala;) = a;|a;). Suppose that, in the A 
basis, (i.e., the basis formed by the |a,,)s): 


1 
1 
aa 


(a) What are (Ai |E), (A| E1), and (A| E1)? 
(b) What is the matrix element A; in the A basis? 
(c) What is Aj, in the energy representation (basis)? 


Problems 81 


7. Suppose that for some system, G\y;) = j|), where j = 1,2. 
(i) Construct the matrix representation of G in the y basis. 
(ii) Show explicitly that the matrix of Part (i) satisfies the above 
eigenvalue equation. 
(iii) Suppose that the y and 8 bases are related by: 


ly) = 5l) — — [2 162), ly2) = \[211) + + l2) 


Construct the matrix representation of G in the b basis. 
(iv) Show explicitly that if G of Part (iii) acts on one of its eigenstates, 
we get what we should. 


7 


Commutators and 
Uncertainty Relations 


Thus, the more precisely the position is determined, the less precisely the momen- 
tum is known, and conversely. In this circumstance we see a direct physical 
interpretation of the equation pq — qp = —ih. 

I believe that one can fruitfully formulate the origin of the classical “orbit” in this 
way: the “orbit” comes into being only when we observe it. 


As the statistical character of quantum theory is so closely linked to the inez- 
actness of all perceptions, one might be led to the presumption that behind the 
perceived statistical world there still hides a “real” world in which causality 
holds... One can express the true state of affairs better in this way: Because 
all experiments are subject to the laws of quantum mechanics... it follows that 
quantum mechanics establishes the final failure of causality. 

Werner Heisenberg! 


Although I introduced the statistical interpretation of quantum 
mechanics in Chapter 3, the subsequent discussion has been rather for- 
mal—physical meaning seems to have faded from view. In this chapter I 
continue the formal development of quantum mechanics, but I also return 
to its physical interpretation. 

A few basic features seem to be responsible for many of the puzzles 
of quantum mechanics. This chapter focuses on two such features: com- 
mutators and uncertainty relations. These topics are closely entwined, and 
both have been the subject of much debate regarding their import and 
interpretation. The introductory quotations—from Werner Heisenberg’s 
original 1927 paper on the uncertainty relations—illustrate the profound 
physical and philosophical conclusions that he drew from those relations 
in quantum mechanics’ infancy. The uncertainty relations? were histori- 
cally viewed as fundamentally important—even as forming the basis for 
quantum mechanics itself.? Although this view has been largely eclipsed, 


1 Heisenberg (1927). 

2 The uncertainty relations are also known as the uncertainty principle, the indeter- 
minacy relations, or the indeterminacy principle. 

3 See, for example, Jammer (1974), Chapter 3. 
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the uncertainty relations remain an important element in the structure of 
quantum mechanics. 

I first develop the commutator, upon which the general form of the 
uncertainty relations is built. I then discuss the uncertainty relations them- 
selves, including the interpretation of them that I regard as proper, and that 
is in harmony with the statistical interpretation of quantum mechanics. 

More than usual, the discussion will take on historical overtones. The 
history of quantum mechanics is deeply fascinating, and deserving of study 
in its own right. My motivation, though, is more pragmatic: few topics 
in quantum mechanics have been subject to so many different and often 
fiercely defended interpretations as have the uncertainty relations. In par- 
ticular, discussions found in early works may seem unclear, and unrelated 
to—perhaps even irreconcilable with—the modern approach that I adopt. 
Thus, it is worth contrasting this approach with other important per- 
spectives that have been advanced since Heisenberg first introduced the 
uncertainty relations in 1927. 


7.1 The Commutator 
7.1.1 Definition and Characteristics 


Before one can carry out calculations within some mathematical frame- 
work, the fundamental objects that comprise that framework must be 
defined, and the rules for manipulating those objects established. In ele- 
mentary algebra, for example, we deal with numbers and functions. Two 
fundamental algebraic rules concern the multiplication of numbers and of 
functions. For two numbers a and b, and for two functions f(x) and g(x), 
multiplication is commutative: 

ab=ba, and f(x)g(x) = 9(2)f(2), (7.1) 
that is, the order of multiplication doesn’t matter. With a bit of practice, 
the commutativity of both numbers and operators with respect to multi- 
plication becomes “obvious”, and one applies the rule without giving it any 
thought. 

In quantum mechanics, the objects that we deal with are numbers, 
states, and operators. Perhaps the most fundamental rule in quantum 
mechanics (apart from the fact that the ordinary rules of algebra apply 
to numbers and functions) is that operators are linear. This means that an 
operator, say Ĝ, obeys: 


G (alp) + blo) = aG) + bG|¢). (7.2) 


It would be nice if we also had at hand a simple, universal rule for the 
“multiplication” of operators, that is, for the manipulation of expressions 
such as AB relative to BA. Alas, no such rule exists. 
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To characterize the commutativity, or lack thereof, of two objects a and 
B, we define their commutator, |a, 8], as 


[a, 6] = ap = Ba. (7.3) 


From Eq. (7.1), if œ and 8 are numbers, or functions, their commutator 
vanishes: they commute. The situation is far more subtle for two operators, 
say F and G. The lack of a universal rule as to the commutativity of 
operators is manifested in the fact that whether Ê and Ĝ commute, that 
is, whether [Ê : â] = = 0, depends on the operators in question. 

In introductory treatments, the only commutator discussed may be 
that between the one-dimensional position and momentum operators 
(in position representation), ĉ = x and p, = —ih d/dx, respectively. 
Specifically 


[$, Po] = Px — Pet = th. (7.4) 


This limitation is in part understandable: position and momentum are the 
observables that seem most intuitively palatable. Moreover, they were the 
first that the founders of quantum mechanics had to deal with, and their 
non-commutativity—the “qp — pq swindle,” as Schrodinger called it—was 
a great puzzlement. 

But in implicitly setting [#, s] as the prototypical commutator, a seri- 
ous misconception may be introduced. That misconception stems from a 
fundamental characteristic of commutators. 

Consider [F', G] = FG — GF. What are the objects FG and GF? Con- 
ow FG |W). This expression says: first carry out the instructions specified 

G (on |w)), and then carry out those specified by F. But we could com- 
ee these into a single instruction set, a new operator, say Ky = = 1G. 
Similarly, GF is also an operator, say K. Clearly, then, [F’, G] is also an 
operator, here denoted &: [F,G] = K} — K_ = &. Thus, the commutator 
of two operators is, in general, another operator. The commutator of Eq. 
(7.4)—a constant—is, therefore, an exceptional case. 

When we write Ê, G] = FĜ-— ĜÊ, we’ve “evaluated” the commutator. 
But to evaluate its role in a calculation, we generally need more: we need 
to know the state it acts on. Suppose, for example, that |Ê, Ĝ] = d?/dz*; 
clearly we cannot evaluate d?/dzx? in a calculation without specifying the 
state (i.e., function) it acts on. So, a corollary to the fact that the commu- 
tator of two operators generally yields another operator is that evaluation 
of a commutator in a calculation generally requires specification of the state 
the commutator acts on. 

There are, however, exceptions: Eq. (7.4) is one, another is 
[d/dx, d? /dz?], which trivially vanishes. Moreover, it’s not hard to show 
that, if F and G share a complete set of eigenstates, [F, G| vanishes 
identically (i.e., for any state). 
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The possibilities for a commutator of two operators, then, are 


e the commutator vanishes identically, 

e the commutator is a constant, 

e the commutator is an operator; a state must in general be specified to 
evaluate the commutator in a calculation.+ 


One practical result of the preceding discussion is that if two operators 
share a complete set of eigenstates, they commute. In that case, we are free 
to change the order of operation of two such operators in a calculation, 
regardless of the state the operators may act upon. 

The commutator also enters quantum mechanics in other, more fun- 
damental ways. One is that the commutator of two operators is related 
to symmetries, conservation principles, and the group-theoretical structure 
of quantum mechanics. Because these are rather advanced topics, we will 
largely ignore them, apart from a brief discussion in Appendix D. 

Two applications of the commutator play important roles even in ele- 
mentary quantum mechanics. First, the time dependence of the expectation 
value of some operator G in the state |y(t)) is given by 


WHIG) = WOI, GIE) + WO IE), (7.5) 


where H is the Hamiltonian operator. I state this here only to point out 
that the commutator plays an important role in time-dependent quantum 
mechanics.” Second, the general formulation of the uncertainty relations, 
discussed in Section 7.2, is based on the commutator. 


7.1.2 Commutators in Matrix Mechanics 


Before leaving the commutator proper, I illustrate its manifestation in 
matrix mechanics. For a two-state basis, an operator is a 2 x 2 matrix. It’s 
easy to show that matrix multiplication is, in general, not commutative. If 
Ê and G are operators, then [F’, G] is 


Fiu Fig Gu Gi \  ( Gu Gis Fi Fiz (7.6) 
Fai Fo G21 G22 G21 G22 Fao Fo j 
In general, none of the matrix elements of [Ê i Ĝ] vanish. For example, the 


1,1 element is 
Fi2Go1 — Gi2F a1. (7.7) 


Note that the action of the matrix resulting from Eq. (7.6) is, in general, 
state dependent. An easy way to see this is to write out the matrix |Ê, G] 


4 Examples are the commutators of the angular momentum operators. See Chapter 8. 
5 Time dependence is discussed in Chapter 11. Equation (7.5) is derived in 
Section 11.3. 
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explicitly, and allow it to act (separately) on the two basis states of the 
(unspecified) basis in which we are working. 
We could, however, imagine a case such that 


fa-6R=( 5 a) (7.8) 


with a a constant. Then the action of [Ê ; Ĝ] on any state is simply multipli- 
cation of each component by a—the action of [Ê A Ĝ] is state independent. 
Thus, Eq. (7.8), like Eq. (7.4), is a special case. 

Finally, let’s consider a case where |Ê, G] = 0 identically. We know that 
this holds if Ê and G share a complete set of eigenstates. Writing |Ê, G] in 
the two operators’ mutual eigenbasis, we have 


Fı 0 Gu 0 [Gu 0 Fir U 
0 Foe 0 Go 0 G2 0 Foe 


whose action on any state is obvious. 


7.2 The Uncertainty Relations 
7.2.1 Uncertainty Products 


In Chapter 3 we saw that the term “uncertainty” may refer either to 
the uncertainty in a set of measurement results, or to the uncertainty in 
a theoretical (calculated) quantity. In this section we will be concerned 
with the latter case—specifically, the uncertainty in the probability dis- 
tributions that arise in quantum mechanics. The fundamental importance 
that attaches to the uncertainty relations, however, stems not from the 
uncertainty in any single observable, but in the product of uncertainties of 
different observables. 

The uncertainty in a probability distribution describes a characteristic 
of that distribution—roughly speaking, its width, or spread. Suppose that 
the probability distributions for two quantities are not independent—that 
they are constrained such that both distributions cannot be made arbitrar- 
ily narrow. It could be the case, for example, that if one distribution is 
narrow, then the other must be at least wide enough that the product of 
the two uncertainties obeys some lower bound. This is the essence of the 
quantum-mechanical uncertainty relations. 

Just as the first exposure to commutators is often through [ĉ, fz] = th, 
so the first exposure to the uncertainty relations is often through ArAp, > 
h/2.° This inequality, a constraint on the product of two uncertainties, 


6 In quantum mechanics, the standard deviation (uncertainty) in some quantity y is 
typically not denoted o(y), as in Chapter 3, but Ay. 
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implies that the probability distributions for x and py are somehow related. 
In fact we know that this is true: a well known mathematical technique, 
the Fourier transform, takes a state from the position representation to the 
momentum representation, and vice versa. Thus, the x representation of 
a state (wavefunction) fully determines the state’s pẹ representation. This 
is a particular case of a central theme of Chapter 4: that there is but one 
system state and, by an appropriate change of basis, we can obtain the 
probability distribution for any relevant observable. 


7.2.2 General Form of the Uncertainty Relations 


In Section 7.1.1 we found it unfortunate that [%,p,] = ih often plays the 
role of prototypical commutator. So, too, it is unfortunate—and for similar 
reasons—that AvAp, > h/2 often plays the role of prototypical uncertainty 
relation. For Heisenberg, the profound questions that swirled about the 
physical meaning of position and momentum in quantum theory in 1927 
(and that largely survive today) found focus in ArAp, > fi/2. But this 
uncertainty relation obscures fundamental characteristics of uncertainty 
relations generally. 

It was some two years after Heisenberg’s uncertainty paper appeared 
that H. P. Robertson presented what is now the best-known general form of 
the uncertainty relations.’ In this general formulation, position and momen- 
tum play no privileged role. For any two Hermitian operators A and B, 
Robertson found that 


ngs CAB 


(7.10) 
Derivations of this general form are readily available in textbooks. Our 
interest is in elucidating its facets and interpretation.® 

Because AAAB is so closely related to ({A, B]), we expect that uncer- 
tainty products and commutators will share important characteristics. 
Commutators generally result in new operators, so that calculations involv- 
ing commutators are generally state dependent. Thus uncertainty products, 
also, are generally state dependent—a fact made very clear by writing 
out the expectation value ([Â, B]) in Eq. (7.10) as (W|[A, B]|¥). Clearly, 
AxzAp, > h/2 is an exceptional case (as is [%, fe] = if for commutators). 

As we saw in Section 7.1, if two operators share a complete set of 
eigenstates, their commutator vanishes identically. Evidently, then, the 
uncertainty product in the observables corresponding to such operators 
must also vanish identically, that is, for any state. 

There is thus a close correspondence between the characteristics of com- 
mutators and those of uncertainty relations. Remember, however, that the 


T Robertson (1929). 

8 Time is a parameter in quantum mechanics, not a Hermitian operator. Because 
of this, the so-called “energy-time uncertainty relation” (see Chapter 11) cannot be 
obtained from Eq. (7.10)—a statement about Hermitian operators. 


88 Commutators and Uncertainty Relations 


uncertainty relations are inequalities. If, say, [Ê Ĝ] vanishes identically, 
then clearly AF AG > 0. This does not, however, imply that either AF = 0 
or AG = 0, since the uncertainty relation only sets a lower bound. 

But can’t we do more than determine a lower bound? Isn’t the quantum 
world better defined than this? Yes. If we specify two operators and a 
state, then we can always, in principle, calculate an uncertainty equality: 
the actual product of the standard deviations of the two observables for 
the given state. 

Consider the important case of x and p,. For every possible state, cor- 
responding x and p, probability distributions exist. For each such pair of 
distributions, in turn, a perfectly well defined uncertainty product ArAp, 
exists. Any state must satisfy the uncertainty relation AxAp, > h/2, but 
the uncertainty equality formed by the lower limit, AxAp, = h/2, is satis- 
fied only if the state is a Gaussian (a result derived in many textbooks).° 

What, then, of the general case when the uncertainty product is state 
dependent? Specification of the state then determines both the uncertainty 
relation (the inequality) and the uncertainty equality. Although Eq. (7.10) 
only yields an inequality, it does provide a clear, concise statement about 
the constraints between quantum probability distributions; typically we 
need not calculate uncertainty equalities. 


7.2.3 Interpretations 


The uncertainty relations are not simply a mathematical result, derived in 
antiseptic fashion from the formal structure of quantum mechanics. They 
are also the vehicle by which Heisenberg and others tried to reconcile the 
concepts of classical physics with the new quantum mechanics. !° 

In this section I will first discuss two physical interpretations of the 
uncertainty relations that Heisenberg advanced. I do so not because they 
are compelling, but because they and their progeny have spawned so many 
varied and often conflicting statements. They thus form an important his- 
torical and conceptual backdrop for the approach that I advocate, and that 
we then turn to: interpreting the uncertainty relations in the context of the 
statistical interpretation of quantum mechanics. 
Measurement interactions As Max Jammer has pointed out,!! in formu- 
lating the uncertainty relations Heisenberg asked two distinct but related 
questions: 


(1) Does the formalism allow for the fact that the position of a 
particle and its velocity are determinable at a given moment only 


9 That is, AxAp, > h/2 is state independent, but ArAp, = h/2 is state dependent. 

10 A highly recommended discussion of the uncertainty relations in mathematical, 
historical, and interpretive context is Hilgevoord (2001). 

11 Jammer (1974), Section 3.2. 
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with a limited degree of precision? (2) Would such imprecision, 
if admitted by the theory, be compatible with the optimum of 
accuracy obtainable in experimental measurements? 


Clearly, (1) has to do with the formalism—the mathematical structure—of 
quantum mechanics, while (2) concerns the limitations of experiments. 
Heisenberg’s answers to these questions differed accordingly. 

To address (2), he considered a gedanken-experiment (thought exper- 
iment) involving a microscope in which an electron is illuminated with a 
gamma ray (a high-energy photon). Briefly, the argument is as follows. As 
the energy of the photon is increased, the position of the electron may 
be more accurately determined (thus the use of gamma rays); however, 
higher energy also results in a greater, uncontrollable change in momen- 
tum. The gamma-ray microscope is thus one manifestation of the opening 
quote of this chapter: “the more precisely the position is determined, the 
less precisely the momentum is known, and conversely.” 

Simple as it may seem, the implications of the gamma-ray microscope 
are far from obvious. Recall the provocative statement of Heisenberg and 
Max Born in Chapter 3: “We maintain that quantum mechanics is a com- 
plete theory; its basic physical and mathematical hypotheses are not further 
susceptible of modifications.” 

Yet the gamma-ray microscope does not support completeness—in fact, 
it contradicts it. In Heisenberg’s microscope, perfectly well-defined val- 
ues of the electron’s position and momentum evidently exist; yet there is 
no experiment by which both can be perfectly known. And Heisenberg’s 
experimentally-based argument is apparently reflected in the quantum 
formalism, through the uncertainty relations. Taken at face value, then, 
the gamma-ray microscope contradicts completeness, for if the electron’s 
position and momentum are precisely defined, and exist, yet quantum 
mechanics cannot account for that fact, the theory must be incomplete.!? 

The analysis of gedanken-experiments as an approach to the uncertainty 
relations is important for another reason: such experiments formed the basis 
for intense discussions between Neils Bohr and Albert Einstein regarding 
the interpretation of quantum mechanics—most notably at the 1927 Solvay 
congress. These were no ordinary academic debates. Jammer has called 
them “one of the great scientific debates in the history of physics...a 
clash between two of the greatest minds of their time.”!% As such, they 
rightfully occupy a special place in the history of modern physics. Per- 
haps because of them, expositions of the uncertainty relations have often 
been, and sometimes still are, framed in terms of gedanken-experiments. 
Philosophical arguments aside, however, the fact is that we do not need 


12 This isn’t intended as proof of quantum mechanics’ incompleteness, but it does 
suggest that this interpretation of the uncertainty relations is open to question. 
13 Jammer (1974), p. 120. 
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to invoke physical pictures such as those utilized in the gamma-ray micro- 
scope gedanken-experiment to acquire an operational understanding of the 
uncertainty relations. 


Simultaneous measurements To answer question (1), Heisenberg took the 
Fourier transform of a Gaussian wavefunction, thus obtaining the state in 
momentum representation (see Chapter 12). Calculating the product of the 
standard deviations in x and pz, he obtained the now famous uncertainty 
relation: AvAp, > h/2. How do we interpret this result? 

The mathematical uncertainties, Ax and Ap,, presumably correspond 
physically to a spread in measurement results. Because both Aw and Ap, 
are calculated for the state at some particular time, we may interpret them 
as the spreads in x and p, for simultaneous measurements. A Gaussian 
wavefunction is mathematically simple, but the fact is that, quite typi- 
cally, at any time t a wavefunction generally yields non-trivial probability 
distributions for both x and py. 

Whether or not simultaneous measurements are in fact possible has 
been a matter of some debate. Regardless, if one insists on the complete- 
ness of quantum mechanics, things become outright weird. If quantum 
mechanics is complete, then the state must fully describe an individual 
system, say an electron. But all information about the electron must be 
contained in the state, and the state yields “blurry” distributions of x and 
Px. Thus, the uncertainty relations evidently imply constraints on the exis- 
tence of simultaneously well-defined properties of a single system—that is, 
not only are the probability distributions for x and p, indistinct, so are x 
and Px themselves. Again, this strange result arises from an insistence on 
the completeness of quantum mechanics. 

It is important to realize that even in this interpretation, one obtains an 
eigenvalue—a single, well-defined result—in any one measurement. If this 
were not true, then the third postulate (see Chapter 2) would be violated. 
Thus, the blurring of, say, x and py does not appear in any individual 
measurement, but only in an ensemble of measurements.!* 


Statistical interpretation If you’re familiar with the difficult questions 
surrounding the measurement interaction and simultaneous measurement 
approaches to the uncertainty relations, the statistical approach may seem 
like a breath of fresh air. In Chapter 3 we suggested that the statistical 
interpretation of quantum mechanics instructs us to take seriously what 
the theory does tell us, and to not take seriously what it doesn’t tell us. 
What does an uncertainty relation, such as AvAp, > h/2, tell us? 
Clearly, it tells us that the probability distributions of x and pz are not inde- 
pendent. Moreover, we expect probabilities to be reflected in an ensemble 


14 This arises from the notorious “collapse of the wavefunction”, in which measure- 
ment suddenly and mysteriously collapses the state from a blurry distribution to an 
eigenstate. 
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of measurements. So Ax and Ap,, the spreads in x and pgz, respectively, 
are manifested in a series of measurements on identically prepared sytems 
(e.g. electrons). 

As in the simultaneous measurements interpretation, Ax and Ap, apply 
to a state at a particular time. But in the statistical interpretation, prob- 
abilities refer to ensembles, so there is no reason to interpret uncertainties 
as necessarily applying to simultaneous measurements of both x and py 
on a single system. We interpret Ax and Ap, as the spreads in x and py, 
respectively, that appear in making many separate measurements on iden- 
tically prepared systems—one measurement of either x or py, not both, per 
system. 

It may seem puzzling, but even in the simultaneous measurements 
approach, probabilities are always manifest in a series of measurements, 
never in a single measurement. That is, even if we somehow simultaneously 
measure x and py, we can obtain no evidence of a probability distribu- 
tion until multiple measurements are made. The real difference between 
the simultaneous and statistical interpretations of the uncertainty relations 
lies not in what is calculated or measured, but in the meaning we attach 
to the results. 

The measurement interaction interpretation is an attempt to under- 
stand the uncertainty relations in terms of largely classical physical pic- 
tures. The simultaneous measurements interpretation is more abstract and 
less classical, but arguably reads too much into the uncertainty relations. 
The statistical interpretation offers no physical explanation for the uncer- 
tainty relations, but simply implores us to interpret them operationally, as 
we would in the laboratory. 

In Chapter 3, I argued that a chief virtue of the statistical interpretation 
of quantum mechanics is that it affords us a means to correctly think about 
quantum mechanics without becoming unnecessarily embroiled in founda- 
tional issues. Just that virtue is at work here. We take probabilities as the 
fundamental stuff of quantum mechanics, and that’s enough to correctly 
represent how quantum mechanics is used in actual practice. And we let 
the experts argue about precisely what, if anything, those probabilities tell 
us about the world beneath the level of the formalism. 


7.2.4 Reflections 


The uncertainty relations reflect the fact that quantum-mechanical proba- 
bility distributions are both fundamental and interdependent. But why is 
this so significant? 

For a classical particle—the most basic object in classical mechan- 
ics—position and momentum are both independent and well defined. The 
particle’s position does not determine its momentum (and vice versa), and 
both quantities may be specified to arbitrary precision. The uncertainty 
relations tell us, however, that quantum probability distributions cannot be 
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independent. It is, for example, impossible for a quantum state to possess 
a finite probability distribution in x and no spread in py (i.e. Ax < 00 
and Ap, = 0), or conversely, and similarly for many other pairs of observ- 
able quantities. Thus, quantum states are such that observable quantities 
cannot, in general, be specified to arbitrary precision. This is a remarkable 
result, for the quantum state (not a particle) is the most basic object in 
quantum mechanics—indeed, if one accepts completeness, it is the most 
basic object in the world. 

The uncertainty relations are in fact deep statements of ideas that I 
have emphasized from early on: the fundamentally probabilistic nature of 
quantum mechanics, and the embodiment, within the quantum state, of all 
probabilities for the system. In a sense, however, they are simply a conse- 
quence of the superposition principle—the fact that we may expand a state 
in the eigenstates of different Hermitian operators, and from these expan- 
sions obtain the relevant probabilities. Different expansions yield different 
probability distributions, yet they cannot be independent, since all are 
extracted from the same state. The uncertainty relations quantify this fact. 

The uncertainty relations have largely faded from the forefront of physi- 
cal thought—not because they are incorrect or irrelevant, but because they 
are but one manifestation of the deep, binding thread woven throughout 
quantum mechanics by the superposition principle. 

It is difficult now to imagine the intellectual fortitude that Heisenberg 
brought to bear in his pioneering work on uncertainty. Although our per- 
spective on them has evolved greatly since 1927, the uncertainty relations 
remain one of the great intellectual constructs not only of 20th century 
physics, but of human thought generally. As John Rigden wrote, 


When first-rate minds are engaged in the intellectual activity 
called physics, as was the case in February 1927 when Heisen- 
berg was struggling with the “pq — qp swindle,” it is an activity 
with no equivalent...in any intellectual arena except, possibly, 
first-rate theological thinking. These special times in physics do 
not come often, but when they do, physicists must often create 
new constructs for which neither previous experience nor previous 
thought patterns provide guidance...Soon the new ideas become 
the basis for empirical predictions and, in the process, a “sense of 
understanding” emerges. However, in the end, the basic concepts 
of physics are aloof, they remain outside our ability to convey 
their meaning. 


Rigden continues: 


With a bright, attentive person eagerly awaiting new insight, 
I do not believe I could convey the essence of the physics 
revolution... With great erudition I could talk about electrons 
passing through a double slit, about wave-particle duality, about 
the inherent difficulties of knowing where an electron is and where 
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it is going. My best efforts, however, would fail: the essence 
of quantum mechanics would remain mysterious, aloof from my 
student’s comprehension and alas, aloof from mine.!° 


Heisenberg, Einstein, Born and others ushered in the age of modern physics, 
of relativity and quantum mechanics. But so too did they usher in an age in 
which even physicists face a gap between physical prediction and physical 
understanding—a gap between the world we perceive, and the world of 
which modern physics speaks. 


7.3 Problems 


1. Suppose that, for some particular system and state, ADAE > T, 

where T is a positive number. Consider the following statement: Phys- 
ically, ADAE >T means that a single measurement of D (or E)will 
not result in a sharp value for D (or E). 
If you agree with this statement, say so, and explain how the statement 
is reflected in the above uncertainty relation. If you disagree, say so, 
and briefly explain what ADAE >T does mean in terms of physical 
measurements. 

2. The three Hermitian operators, 


a WK0 1 a h(/0 i a hf1ı 0 
8-35 (| a 8-5 (4 0 ). 8.=3( 4 a) 


represent the spin angular momentum for a spin l system. The eigen- 
states of S, are denoted | + z) and | — z), and similarly for the 
eigenstates of Si and Sz. The eigenstates of ô, and Sy, written in 
the | + z) basis, are 


|£0)= (+2) 41-4), +n = (|+2) 24-2) 


(i) The operators are all written in the same basis. What basis is it? 
How do you know? (You should be able to answer this question simply 
by inspection.) 

(ii) Find all three (non-trivial) commutators of these operators. 

(iii) Write your results explicitly in terms of Se Ss and S$, (i.e., with 
no matrices appearing). 

(iv) Evaluate the uncertainty relations AS,AS, and AS,AS, for the 


states: 
w=(1) =li) 


15 Rigden (1987). 


94 Commutators and Uncertainty Relations 


The states are written in the same basis as the operators, above.) 

v) In Part (iv) we obtained: 

a) the uncertainty relation between Sy and Sy for the state |q1), 

b) the uncertainty relation between Sy and Sy for the state |wW2) 

c) the uncertainty relation between Sy and S, for the state |q1), 
(d) the uncertainty relation between Sy and S$, for the state |72). 
For each case, argue that the uncertainty relation obtained in Part (iv) 
is consistent with what would be expected, based on the nature of the 
system state with respect to the relevant operators. 

3. Show that: |Â, [B,C] - [BIA, CJ] = (A, B]C + C1, å. 

4. Although no basis was specified for Eq. (7.6), the two basis states (col- 
umn vectors) are easily written down. Write out the matrix resulting 
from Eq. (7.6) explicitly, and allow it to act (separately) on these two 
basis states. Explain how this shows that the action of the matrix 
[Ê, G] is state dependent. 


? 


( 
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8 
Angular Momentum 


Raffiniert ist der Herrgott, aber boshaft ist er nicht. 
Albert Einstein! 


“Subtle is the Lord,” Einstein famously remarked, “but malicious He is 
not.” Some years later, he explained: “Nature hides her secret because 
of her essential loftiness, but not by means of ruse.” Nature—that is, 
physics—is, after all, largely susceptible to human comprehension, but it 
need not behave in accord with our naive hopes or expectations. 

In this chapter, I first review angular momentum in classical mechan- 
ics, and then press on into quantum-mechanical angular momentum. In 
both cases, angular momentum is of fundamental importance. In quantum 
mechanics, however, angular momentum forces us to deal with the physical 
world on an even more abstract level than we have so far. But that is the 
nature of quantum mechanics, regardless of what we might wish the world 
to be. “If God created the world,” said Einstein, “his primary concern was 
certainly not to make its understanding easy for us.”? 


8.1 Angular Momentum in Classical Mechanics 


Both mathematically and conceptually, angular momentum in quantum 
mechanics bears little resemblance to its classical counterpart. Nevertheless, 
a brief review of the classical case is worthwhile, so as to refresh old ideas, 
and perhaps dispel misconceptions, before plunging into quantum angular 
momentum. 

First, consider two vectors A and B, and their vector product, denoted 
A x B, and defined as® 


Ax B = (A,B, — A,B,)#— (ArBz — A,Bz)§ + (ArBy — AyBr)2 = Č. 
(8.1) 


Note that A x B yields another vector—thus the name vector prod- 
uct*—which we’ve called Č. It is a direct mathematical consequence of 


1 Said to Oswald Veblen in 1921. See Sayen (1985), pp. 50-51. 

2 Calaprice (2000), p. 218. 

3 In this section, the hat notation denotes a unit vector rather than an operator. 

4 4 x B is also called the cross product. Compare A. B, the scalar, or dot, product. 
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(a) z (b) yt 


= 


Fig. 8.1 (a) The classical angular momentum, L, is perpendicular to both 
r and p (here p does not lie in the xy plane); (b) the simplified case of circular 
motion in the zy plane (here 7 and p are perpendicular; L points into the 
page). 


Eq. (8.1) that |A x B| = ABsinð, where @ is the angle from A to B. 
Moreover, it’s easy to show that (Ax B)-A = (Ax B)- B = 0, so the direc- 
tion of C must be perpendicular to the plane formed by A and B. There 
are only two directions perpendicular to a plane, and the sole purpose of 
the famous “right-hand rule” is to (correctly) choose between those two 
possible directions.° 

Returning to physics, the classical angular momentum, L, is defined as 


i =rx p= (rypz T-Py)ĉ = tips T-P) + (ro Py a Pipe) es (8.2) 


Here p = mv, the linear momentum, and 7 is the position vector from the 
point about which L is calculated to the object whose angular momentum 
we are calculating (see Fig. 8.1(a)). 

Equation (8.2) is the general and complete definition of classical angular 
momentum: nothing need be added, and no restrictions need be imposed. 
This fact has important, and often overlooked, implications. 

First, we can choose any point about which to calculate L. If the object 
is rotating about some point, such as a wheel rotating on an axle, we 
usually choose that point of rotation as the point about which to calculate 
the angular momentum. (In both Figs. 8.1(a) and 8.1(b), L is calculated 
about the origin.) But the definition of L, Eq. (8.2), does not require us to 
do so. 

In addition, Eq. (8.2) places no restriction on the nature of the object’s 
motion. The object could, for example, rotate in a circle, but it need not do 
so. Whether it rotates, moves in a straight line, or describes some compli- 
cated trajectory, E remains a well-defined quantity. Regardless of the point 
about which we choose to calculate Ti, then, and regardless of the object’s 


5 Discussion of the right-hand rule can be found in most any introductory physics 
text. 
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motion, L is a perfectly well-defined quantity. Whether L is of any interest 
for a particular case is, of course, a separate question. 

Introductory discussions often focus on the magnitude of E (i.e, L = 
rpsin@), along with the right-hand rule. It should be clear from our dis- 
cussion of the vector product that this approach is simply a reformulation 
of Eq. (8.2). 

Such discussions also often focus on circular motion, with È calculated 
about the axis of rotation (as in Fig. 8.1(b)). This simplified treatment 
makes pedagogical sense, but it can lead to misconceptions such as those 
outlined above. First, in this case 7 and p are always perpendicular, so 
L=rpsin@ — L = rp. Also, 7 is taken from the axis of rotation, which 
can lead to the (false) assumption that angular momentum always deals 
with rotation about an axis, and that L must be calculated about that axis. 

Finally, because the circular motion is confined to a plane, there are 
only two possible directions for Č at all times: into the page or out of the 
page. Thus, we can represent L’s direction with + and — signs alone. In Fig. 
8.1(b), for example, L points into the page; we could assign this direction 
of La — sign. For counter-clockwise rotation, L would point out of the 
page, and we could assign this direction a + sign. For this simplified case, 
with only two possible directions for É; assigning direction through signs 
alone works fine. Still, it obscures the full vector nature of L, and it can 
instill the misconception that angular momentum can always be treated 
this simply—which it clearly can’t.’ 


8.2 Basics of Quantum Angular Momentum 
8.2.1 Operators and Commutation Relations 


Quantum-mechanical angular momentum is a subject both deep and wide, 
and our goal can only be to introduce the essentials. Section 8.1 served 
largely as a review of angular momentum in classical mechanics, but it also 
provided the tools to begin discussing quantum angular momentum. 

In a real sense, the commutation relations for angular momentum oper- 
ators form the foundation for quantum angular momentum generally. These 
commutators may be obtained through a mathematically sophisticated 
approach based on continuous transformations (see Appendix D). For our 
purposes, however, a less elegant but more straightforward approach is 
preferable. 

From Eq. (8.2), we see that the z component of the classical angu- 
lar momentum is: L; = (rzPy — TyPx). How could we construct Lz the 
quantum-mechanical operator corresponding to L,? For a one-dimensional 


6 Similarly, for one-dimensional motion (e.g. along the x axis) there are only two 
possible directions, and these may be described with + and — signs rather than vector 
notation. Clearly, this fails for multi-dimensional motion. 
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system in position representation (i.e. a = v(a,t)), we know that the posi- 
tion operator is just x itself, ¢ = x, and the momentum operator is given 
by py = thd. Exactly analogous expressions hold for 9, py, 2, and pz. 

To construct L,, we may “quantize” L,, by simply replacing the classical 
quantities in L, with the corresponding quantum-mechanical operators: 

n Z a : o o 
L; — L, = ĉfy — 9p: = in(y— — s2). (8.3) 
Ox Oy 
(Because 7 is the position vector, re and ry are replaced with ĉ and 4, 
respectively.) Ly and Ly are similarly obtained. 

Once we have the explicit form of the angular momentum operators, 
we can calculate their commutators directly. As discussed in Chapter 7, 
the commutator of two operators is, in general, another operator, not a 
constant. For Lg, Ly, and Lz, we obtain 


[Ee 2y] = ifa, ([£2,L.)=ihl,, [Êy Êa =ihh, . (8.4) 


Like any other vector, L’s components together contain all information 
about L itself. In particular, the magnitude (squared) of L is obtained from 
its components by: I? = L = 12 + L? + Lz. (Here I’ve used the common 


notation L? = L. L.) . 
The analogous quantum quantity, L?, is defined as 


P=2+12+ 7. (8.5) 


Using Eqs. (8.4) and (8.5) to calculate the commutator of Ê? with Ly, Ly, 
and L., leads to 


[L?, L] T 0, J = 1, 2,3. (8.6) 


Here x corresponds to 1, y to 2, and z to 3. 

These results provide an object lesson in the significance of commu- 
tation relations. All of the operators, the three Ls along with Î?, are 
Hermitian. Thus, the eigenstates of each forms a complete set of states, 
that is, any angular momentum state may be represented as an expansion 
of the eigenstates of either L?, or of any one of the L;s. However, because 
the commutators in Eq. (8.4) do not vanish, any two (different) Ls do not 
share a complete set of eigenstates. 

All this leads to the following conceptual picture. Suppose the system 
is in an eigenstate of one of the L;s, say Ês (i.e. È>); This state may also 
be an eigenstate of L?. It cannot, however, also be an eigenstate of either 
Li or Lə. 
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Further insight may be gained from the uncertainty relations, which, 
given Eqs. (8.4) and (8.6), we can simply write down: 


y 2 ; 


> . (8.7) 


n(L.)| 
2 


(AL,AL, and AL, AL, are analogous to the AL, AL, case.) These rela- 
tions are not just abstract mathematics: they tell us something important 
about the physics of quantum-mechanical angular momentum. 

The first uncertainty relation in Eq. (8.7) sets a lower bound of zero 
on AL?AL,j, and this result is independent of the quantum state. Like 
all uncertainty relations, this is an inequality, so it doesn’t say that 
AL?AL; = 0, or, therefore, that AL? = 0 or AL = 0; but it does tell 
us that these equations aren’t prohibited by the uncertainty relations (see 
Section 7.2.2). 

Let’s now consider AL; AL,. From the second uncertainty relation in 
Eq. (8.7), we see that the lower bound set on AL, AL, is determined by 
(Lz), an expectation value, which clearly does depend on the state. If the 
system state is such that (L,) = 0, then the uncertainty relation alone 
cannot tell us whether AL,AL, is zero or non-zero. But if the system 
state is such that (Lz) # 0, then AL, AL, > 0, so both AL, and AL, 
must be non-zero. 

Now suppose, for a particular state, we actually calculate AL, AL, 
directly from the probability distributions (i.e. we do not just use the uncer- 
tainty relation), and find that AL,AL, = 0. It should be clear that the 
only way this can happen is if the state is an eigenstate of Ly or Le or 
both. Conversely, if we directly calculate AL, AL, for some other state and 
find that AL,AL, # 0, then that state cannot be an eigenstate of either 
aoe 


8.2.2 Eigenstates and Eigenvalues 


We know from the postulates of quantum mechanics that if we make a mea- 
surement of some observable quantity, we must obtain one of the eigenvalues 
of the corresponding Hermitian operator. The three operators Ê; all corre- 
spond to observable quantities: each corresponds to angular momentum in 
some direction, in analogy with the three classical quantities Lz, Ly, and 
L, (cf. Eq. (8.2)). In fact, the orientation of our coordinate system is arbi- 
trary, so once we have the eigenvalues and eigenstates for, say, Le we have 
them for all three È; ;s. The operator L? also corresponds to an observable: 
the squared ande of the total angular momentum, corresponding to 
the classical quantity L?. 
For our purposes there’s little to be gained by working through the 
eigenvalue problems for L? and L, in detail (and such solutions are readily 
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available elsewhere). What results is the following: 


L?|l,m) = (L+ 1)h?|L, m) l=0, =, 1, Ž, 2, Ż, 3,... (8.8) 


L.\l,m) = mh|l, m) m=-—l, —I41,...,1-1,1 (8.9) 


Although we’ve seen eigenvalue equations before, we need to unpack these 
a bit. 

Recall that two operators describe angular momentum in quantum 
mechanics: Ê? and Ê,, corresponding to the classical L? and Lz, respec- 
tively. In the ket |l, m), the label l is an index, a bookkeeping tool, that 
denotes a particular eigenstate of L?. While | itself is not the eigenvalue 
of Ê? for |l, m), it’s very simply related to that eigenvalue (i.e. I(l + 1)h?). 
Similarly, m is an index that denotes a particular eigenstate of L., with 
corresponding eigenvalue mh. Note that the value of | determines the pos- 
sible values of m. Suppose, for example, that l = 3. Then m could take on 
the values -3, — Z, L, or 3. From Eq. (8.9) we see that for some particular 
l value there are 2l + 1 possible m values. 

This also suggests why the |l, m} kets have two labels. The |/,m)s are 
eigenstates of both L? and Te: but because l doesn’t uniquely determine 
m, and vice versa, both must be given to specify a particular angular 
momentum eigenstate. 


8.2.3 Raising and Lowering Operators 


Two other important operators are the raising and lowering operators, 
also called ladder operators. The raising operator, L+}, and the lowering 
operator, L_, are defined as: 


De = Le Fily, Î- = aD yc (8.10) 


When dealing with quantum-mechanical angular momentum, it’s conven- 
tional to work in the z basis, that is, the basis formed by the eigenstates 
of L,. For example, we take the state |l, m) to be an eigenstate of L., not 
of Ês or Ly (unless told otherwise). That’s why an eigenvalue equation for 
Ê., but not for L, or Le, appears in Eq. (8.9). And that’s why we have 
only two, not six, raising and lowering operators: it’s implicit that we’re 
working in the z basis. 

Now, any angular momentum state can be expanded in the eigenstates 
of L,. Thus, to determine the action of Lg and L_ on any angular momen- 
tum state, we need only determine their action on the eigenstates of Èz: 
To do so, first use Eq. (8.4) to show that [L,, L+] = +AÊ+. Then let this 
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operator equation act on an eigenstate of L., and use Eq. (8.9): 


(L-a —L4L,)|l,m) = +hL4|l,m) 
-ll m) = £hL|l,m) 


ie {isl m)} =(m+1)h {Lall,m)} (8.11) 


The curly brackets in the last line emphasize that Ê acting on the state 
i,m) creates a new state, ÎÊ+|l, m), which is an eigenstate of Ê, with 
eigenvalue (m + 1). Moreover, using Eqs. (8.6) and (8.10), we see that 
Ê? Lall, m) = L4 L?|l,m) = (1+ 1)h?|l,m). That is, L4 |l, m) is an eigen- 
state of L? with eigenvalue I(1+1)h?. Thus, up to a multiplicative constant, 
the new state must be |l, m 1) (see Eq. (8.9)). 

The monikers “raising operator,” for È, and “lowering operator,” for 
b now become clear: the two operators, acting on an eigenstate of Ties 
raise or lower, respectively, the value of m by 1. Although we won’t work 
it out, the multiplicative constant is not unity. Rather, we have: 


L4|l,m) = JIL +1) — mm1) fi [l,m +1). (8.12) 


Note that this multiplicative constant guarantees that m cannot be raised 
above, or lowered below, the limit given by Eq. (8.9), that is, |m| < L. 

It’s worthwhile introducing the angular momentum raising and lower- 
ing operators—even though we won’t need them—in part because similar 
operators appear elsewhere in quantum mechanics. In the harmonic oscil- 
lator, operators analogous to Ly and L_ raise and lower the oscillator’s 
energy (not angular momentum) eigenstate. In quantum electrodynam- 
ics—the quantum field theory of electromagnetism—the electromagnetic 
field consists of a collection of photons. Mathematically, this field may be 
represented by a collection of harmonic oscillators. The oscillator’s raising 
and lowering operators are now called creation and annihilation opera- 
tors, respectively, because they either create or annihilate quanta of the 
electromagnetic field (photons).” 


8.3 Physical Interpretation 
8.3.1 Measurements 


In quantum mechanics, one can often attain a better understanding by 
considering what actual measurements would yield. Considering mea- 
surement results for L? and L, will help illuminate quantum angular 


T For more on the harmonic oscillator and related topics, see Appendix C. 
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momentum. Moreover, we'll make contact again with the postulates of 
quantum mechanics, from Chapter 2, and the statistical interpretation, 
from Chapter 3. 

In many texts, discussions of angular momentum measurements center 
on the Stern—Gerlach (SG) machine—essentially a non-uniform mag- 
netic field. A system with electric charge and quantum angular momen- 
tum—perhaps a silver atom, as in Stern and Gerlach’s original 1922 
experiment—can possess a magnetic moment that interacts with an exter- 
nal magnetic field. If such an atom passes through an SG machine, the 
atom’s magnetic moment interacts with the magnetic field such that the 
atom (more correctly, the atom’s quantum state) is deflected in different 
directions depending on the value of the angular momentum. The atom 
then impacts a screen.? 

For present purposes, the details of SG experiments, or of any other 
means to perform angular momentum measurements, are of little interest. 
It’s enough to say that we have a device, a “black box”, that somehow 
performs measurements of quantum angular momentum along whatever 
axis we choose. 

The simplest non-trivial case is that of | = 1/2, and thus either m = 
—1/2 or m = 1/2. Suppose the system is prepared with m = 1/2, so our 
state, in the L, basis, is [5 5). From Eas. (8.8) and (8.9) we see that 


Llb =t B=). 8) 


So, for the state 5, 5), we will definitely obtain ñ/2 if we measure L,, and 
3h? /4 if we measure L?. 

What if we adjusted our black box to measure Ly rather than L}? Ly 
is a Hermitian operator, like D,, so its eigenstates must form a complete 
set. For this l = 1/2 case there are only two eigenstates. These form the 
complete set and are exactly analogous to the eigenstates of L,, so we 
denote them |4, i), and [5 aa We can write our system state, |4, 2), 
in terms of this complete set: l4, 2) =e |4, $), +c- |4, oh In addition, 
a pair of eigenvalue equations just like those in Eq. (8.13) hold. 

So, what happens when we measure L,?!° We can only say that we’ll 
obtain h/2, the eigenvalue of Ly for [5 2 with probability |c,|?, and 
—h/2, the eigenvalue of Ly for |5,—5),, with probability |c_|?. If the 
measured result is #/2, the system is left in the corresponding eigenstate, 


8 Measurable quantities, such as L? and Lz, correspond to operators, but aren’t oper- 
ators themselves; thus, I omit “hats” from them. For the same reason, uncertainties in 
measurable quantities, such as AL? and AL; in Eq. (8.7), are not written as operators. 

9 In-depth discussions of SG machines /experiments are readily found. See, for 
example, Bohm (1951), Feynman (1965), or Townsend (1992). 

10 Very similar arguments apply for Ly. 
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|? TE ; if the result is —h/2, the system is left in |4,- DA . (See Postulate 

of Chapter 2.) 

Now consider measuring L?. The fact that Ê? = L2 + i + L? suggests 
measuring, squaring, and adding Lz, Ly, and L, to obtain a measured value 
for L?. Quantum-mechanical measurements are always made on a system 
in some partias quantum state. For the case at hand, “measuring L?” 
really means “measuring L? in the state l4, D So let’s measure Lz, Ly, 
and L, on our system state, l4, 4) z) : 

But there’s a problem. If we measure, say, Ly first, the system is left in 
either |3,3) 3) a or Pa TA So the next measurement is not being made on 
the original state; l4 5 2). If we measure L, first, the state does not change, 
because l4, 4) 5) is an eigenstate of Ê.. However, Ly and Ly must still be 
measured, and the system will not be in 5, 5) for the final measurement. 
And in general, an initial state need not be an eigenstate of La, i , Or L.. 

The solution is to re-prepare the system in the state |4, D) after each 
measurement, so that all measurements are performed on that state. Yet for 
l = 1/2, none of this really matters. For any l = 1/2 state, i/2 and —ñ/2 are 
the only eigenvalues, and thus the only possible measured values, for Lz, Ly, 
and L,. For every measurement of Ly, Ly, or Lz, then, in any | = 1/2 state, 
we get L2 = L? = 12 = hi? /4; thus L? = r +L + B= = 3h? /4. 

But oi. a case where l > 1/2, say l = 1. For each operator, Te Li; 
and La, there are now three eigenstates. And unlike the | = 1/2 case, the 
squares of h, —h, and 0—the various eigenvalues—are not all equal. 

Now prepare an l = 1 state and measure Ly, re-prepare the same state 
and measure Ly, and then do the same for Lz. If the state is a superposition 
in the Lz, Ly, and L, bases, each Lz, Ly, or L; measurement can yield 
h, —h, or 0. Thus, L2, L2, and L2 can be h? or 0, but which we obtain 
in a particular measurement is determined only probabilistically, and is 
independent of previous measurements. Calculating L3 + L? + L2 from 
such a sequence of Lz, Ly, and L, measurements may or may not yield 
2h, the eigenvalue of L?. In general, we'll obtain 2h? only by averaging 
over many sets of Ly, Ly, and L, measurements. For l > 1/2, it seems we 
can’t make a single measurement of L?. 

In fact, measurements of L? are routinely performed—but by somewhat 
indirect methods. Chemists and physicists measure energy levels of atoms, 
molecules, and nuclei. These energies can be expressed in terms of the 
quantum number l, so a measurement of energy effectively constitutes a 
measurement of l, and thus of L?. My goal here has not been to argue that 
L? is unmeasurable, but to use measurement results to better understand 
quantum angular momentum—and quantum mechanics generally. 

It’s worth noting that the discussion of measurements in this section 
is consistent with, and perhaps best understood in terms of, the statis- 
tical interpretation. That is, we treat the quantum state as determining 
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the probability for obtaining a particular value upon measuring some 
observable—nothing more, and nothing less. 


8.3.2 Relating L? and L, 


Inspection of Eqs. (8.8) and (8.9) reveals something odd. We interpret Ê? as 
the operator corresponding to the square of the total angular momentum, 
while Ê, corresponds to the projection of the angular momentum onto the 
z axis. Yet even if we choose the maximum m value for a given | value, L? 
is greater than L2. That is, the magnitude of the total angular momentum 
is greater than the projection of the angular momentum onto the z axis. 

In terms of measurements, it’s simplest to see how this arises for the 
l = 1/2 case. From Section 8.3.1, measurement of Ly, Ly, or L; in any 
l = 1/2 state yields L = L} = L? = W/4, so L? = L + L + 
L? = 3h?/4. Evidently the magnitude of the total angular momentum is 
V3h/2, but the maximum projection is only h/2. How can we make sense 
of this? 

One approach is to try to think of L? as the squared magnitude of an 
ordinary vector, and of L, as the projection of that vector onto the z axis. 
Then the argument, roughly, is that there must be projections onto the x 
and y axes as well, since a measurement of Lz or Ly cannot yield zero. For 
an ordinary vector, such projections can exist only if the vector does not 
lie along the z axis—and this, in turn, implies that L2 < L?.14 

But there’s a price to be paid in this approach: quantum angular 
momentum isn’t an ordinary vector, so one can’t attain a truly correct 
understanding through this vector model, and misconceptions may be 
introduced. Let’s try another approach to understanding the fact that 
L <P. 

Given the classical quantity L? = L? + L? + L2, we're certainly free 
to define the quantum operator Ê? = Ê2 + Í? + L?. But that doesn’t 
necessarily justify attaching the same meaning to the classical L? and the 
quantum L?. Consider the physically meaningful quantities, in the sense of 
physically measurable quantities, in the classical and quantum cases. 

Classically, angular momentum—like position or energy—is a basic 
quantity and enjoys a well-defined existence. But it’s really incorrect to refer 
to the position, energy, or angular momentum of a quantum system—even 
though doing so often causes little trouble.!? In quantum mechanics, it’s 
the state that’s well defined—and the state yields only probabilities for 
observables such as angular momentum. Quantum and classical angular 
momentum, though related, simply aren’t the same—quantum angular 
momentum isn’t an ordinary vector, and we can’t assume it will behave 
as one. 


11 The uncertainty relations are often offered as further justification. 
12 It is legitimate if the system is in an eigenstate of the relevant quantity. 
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But then how does it behave? If the classical vector L points along the 
z axis, then the x and y components of L vanish identically. The analogous 
quantum case is that of an eigenstate of Lz, such as l4, 3). If we expand 


$,4) in the eigenstates of Ly, we get:' 


[a = ala et alae: (8.14) 


Quite generally, if we expand a quantum state in another basis, the basis 
states (i.e. their expansion coefficients) can’t all vanish; if they did, the 
state itself would vanish in the new basis! The coefficients of |4, 5). and 
l4, —4)s—the x “components” of the state—do not vanish, because a 
quantum state is not an ordinary vector in real space. 

However, a measurement of Ly must yield h/2 or —h/2, so L2 > 0, 
which implies that L2 < L?. Note also, from Eq. (8.5), that a measured 
value of L? satisfies: B= L? + L + I. 

A geometrical approach is helpful. The vector G in Figure 8.2(a) clearly 
has non-zero projections onto the x and y axes. Rotating G to align with 
the z axis yields GC. whose projections onto the x and y axes are clearly 
zero. This is precisely how a classical angular momentum vector would 
behave. 

Now, for each operator, Îr, Èy, and Tees there exists a complete set of 
basis states. For the simple l = 1/2 case, two orthonormal basis states 
exist for each operator—that is, for any one axis in real space, there are 
two basis states, and thus two directions in Hilbert space. Figure 8.2(b) 
illustrates this heuristically. Any l = 1/2 state, even an eigenstate of Ls, 
could be written strictly in terms of the basis states of, say, Èa. 


(b) 
|1/2,1/2) 


|1/2,1/2)x |1/2, — 1/2)x 


[1/2, - 1/2) 


Fig. 8.2 (a) The vector G, and its rotated counterpart G’, in real space; 
(b) heuristic representation of basis states of Ês and Lz in Hilbert space for 
r= 1/2, 


13 That is, the coefficients c+ and c_ of Section 8.3.1 actually satisfy c} = c- = 1/V2. 
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Thus, the argument that L? < L? because there must be a projection 
onto the x and y axes is flawed because the situation in real space differs 
from that in Hilbert space. Real space is three-dimensional, and we clearly 
cannot represent an arbitrary vector in real space by considering only one 
axis. But for quantum angular momentum, any one direction in real space 
has a complete set of states, 2l + 1 in number, associated with it. For 
l > 1/2, things aren’t quite so simple as in the discussion above, but the 
additional complication is more a matter of degree than of principle, and 
the basic concepts as developed for the l = 1/2 case remain valid. 


8.4 Orbital and Spin Angular Momentum 
8.4.1 Orbital Angular Momentum 


Classically, it’s easy to imagine an object with angular momentum due 
both to its orbit and due to its spinning. The Earth, for example, orbits 
the Sun, but also spins on its axis. Both motions contribute to the angular 
momentum, and both are defined by L=Fx D. 

Angular momentum due to an orbit and due to an actual spinning 
object can occur in quantum mechanics, also. But in quantum mechanics, 
these would both be treated as orbital angular momentum—spinning is, 
after all, just an object orbiting on its own axis. In quantum mechanics, 
spin angular momentum, or just spin, does not refer to an actually spinning 
object. The two types of quantum angular momentum, orbital and spin, 
rest on profoundly different conceptual bases. 

We didn’t (and won’t) solve the angular-momentum eigenvalue prob- 
lem in detail. Nevertheless, an outline of how it can be solved illuminates 
the difference between orbital and spin angular momentum in quantum 
mechanics. 

One approach to this eigenvalue problem is through the position-space 
representations of the operators L, (see Eq. (8.3)) and L?. Because such 
operators appear in terms of spatial derivatives, they clearly must act on 
a quantum state that is itself written in terms of spatial variables.!4 

If, using these operators, we set up and solve the eigenvalue equations 
of Eqs. (8.8) and (8.9) (which become differential equations), we obtain 
the eigenstates (in position representation) and eigenvalues of Ê, and Ê?. 
However, the half-integer values of l, and thus also of m, that appear in 
Eqs. (8.8) and (8.9) are disallowed. Why? 

Because we are working in position-space, the solutions must be con- 
tinuous functions of position. This implies the imposition of boundary 
conditions that force our solutions to be physically acceptable. But these 
conditions can be satisfied only by disallowing the half-integer values 


14 In practice, one uses the spherical coordinates r, 0, and ¢, rather than x,y, and z. 
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of l. Restated, insisting that the position-space solutions be physically 
reasonable requires us to discard the half-integer values of l. 

Note that in the approach to the angular-momentum eigenvalue prob- 
lem just outlined, we are working in position space—the concept of the 
particle’s position probability forms the essential underpinning. This, we 
will see, is in striking contrast to the situation if one adopts the following 
alternative approach to the problem. 


8.4.2 Spin Angular Momentum 


Quantum mechanics is replete with mysteries, and spin angular momentum 
(or just “spin” ) is one of the deepest. A broad-brushstroke discussion of an 
alternative way in which the angular-momentum eigenvalue problem may 
be solved sheds light on why spin is so abstract and mysterious. 

This alternative approach doesn’t rely on the Schrödinger equation 
or position representation. Recall that in Section 8.2.1 we obtained the 
angular-momentum operators, and then the commutators, from the opera- 
tors ĉ, Px, etc.—but I said that the commutators may also be obtained 
through an approach based on continuous transformations (in partic- 
ular, by studying the behavior of quantum states under rotations). 
Moreover, once they are so obtained, the eigenvalue problem can be 
solved directly from the commutation relations (how is not our concern). 
Remarkably, the half-integer values of | now are allowed—but what can 
they mean? 

The eigenvalues corresponding to half-integer / values characterize spin 
angular momentum. But how can it be that spin apparently violates our 
requirements for the position representation of angular momentum eigen- 
states? Because, unlike orbital angular momentum, quantum-mechanical 
spin angular momentum doesn’t have a position representation; it defies 
physical pictures. Yet the eigenvalues, commutators, raising and lower- 
ing operators, etc., for spin behave just like those for “ordinary”, orbital 
angular momentum. Spin is therefore often called intrinsic angular momen- 
tum—it carries all the hallmarks of a quantum angular momentum, 
but denies us the physical picturability we might hope for. Subtle is 
the Lord. 


8.5 Review 


Our study of angular momentum has taken us to new heights (or is it 
depths?) of abstraction. A brief review might help make sense of it all. 

Starting from classical angular momentum, we saw how the quantum 
operators De. Êy We, and Ê? are constructed from the position and linear 
momentum operater These led to the angular momentum commutators 
and uncertainty relations, from which we concluded that the various Èjs 
do not share eigenstates, but each È; does share eigenstates with iz 
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The eigenvalue equations for L? and Lys and the corresponding quan- 
tum numbers l and m, were then introduced. I also introduced the raising 
and lowering operators, which either increase or decrease the value of the 
quantum number m by 1. 

A discussion of angular momentum measurements followed—not to 
delve into the details of measuring devices, but to illustrate schematically 
how such measurements could be carried out, and to attach physical mean- 
ing to quantum angular momentum. Our discussion also set much of what 
we’ve learned about bases, superpositions, probabilities, and the statistical 
interpretation into operational context. 

Finally, I outlined the deep differences between orbital and spin angular 
momentum. Treating angular momentum in terms of the operators ĉ, pz, 
etc. is useful and illustrative, but not general. An alternative approach, 
using group theory and commutation relations, includes half-integer val- 
ues of | and therefore naturally accounts for both orbital and spin angular 
momentum. Spin is an abstract angular momentum which has no position- 
space representation, and which cannot be associated with the actual 
spinning of an object. 

Quantum-mechanical angular momentum is immensely important: it’s 
crucial in atomic, molecular, nuclear, and particle physics. The coupling 
of two or more systems to form states of total angular momentum plays a 
key role in many applications, and provides a prime example of one of the 
more fascinating topics in physics: quantum entanglement and non-locality. 
Although we’ve only scratched the surface of quantum-mechanical angular 
momentum, the fundamentals are now on the table. 

Practical utility aside, however, quantum-mechanical angular momen- 
tum forces us to directly confront the abstract nature of the quantum world. 
Einstein said that the Lord is subtle but not malicious. Yet, some years 
later, perhaps while immersed in a particularly deep and abstract problem, 
he returned to the subject. “I have second thoughts,” he remarked to his 
colleague Valentine Bargmann, “Maybe God is malicious.” 


8.6 Problems 


1. Obtain the angular momentum commutation relations from the 
operators by direct calculation. 

2. Suppose the operators A, B , and C correspond to observables A, B, 
and C,, respectively, and that they satisfy the following commutation 
relations: 


[A,B]=0, [A,C])=0, [B,C] 40. 
What do these commutation relations tell us about the eigenstates of 


the various operators? In particular, is it possible for any state to be 
an eigenstate of more than one of these operators? Can any state be 
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an eigenstate of all three operators? Discuss what all this means for 
various sequential measurements of the observables corresponding to 
A, B, and C? 

. This problem pertains to a spin Z system. 

(i) Construct spin} matrix representations of Sy and S_; then use 
those matrices to construct the matrix operator ie ô. 

(ii) Using your matrix for 4 ĝ_, operate on the following sping states: 


v 
Ra 
| 
= 


tip 


( 
( 
(c) Ix) = 5l i) + y3 l-i) 
( 


iii) eles Be your results ae sense, that is, why they are what 
you expect, for each of the three cases. 

. This problem pertains to a spin 1 system. 

(i) Write the angular momentum eigenstates (in the conventional z 
basis). 

(ii) Construct an operator Î that transforms any state into the z 
basis. 

(iii) Show that your operator does what it’s supposed to do by oper- 
ating on an arbitrary spin 1 state, denoted |W), with Î. Explain why 
your result is what is to be expected, that is, how can you see that 
it’s correct. 

. This problem pertains to a spin Z system. 

(i) Obtain the uncertainty relation for Sy and Sy, if the system state 
is: |) = Jy (1+ 2) -iv2| - 2). 

(ii) How would you expect the uncertainty relation between Sy and 
Sy, for the state of Part i, to be manifested physically? That is, what 
measurement(s) would you make in the lab to check this relation? 
What would you find, experimentally? Be concise but clear. 

(iii) For the state | + z), we have AS,AS, = 0. By considering the 
nature of |+ z) in relation to the observable(s) in question, explain 
why this must be true. A few carefully worded sentences should be 
sufficient. 

. You have available a beam of spin i particles moving in the +y direc- 
tion, in random spin states. Starting with this beam, carefully describe 
an experiment that could be used to determine AS, for the state 
| +n), where n denotes some arbitrary direction in the xz plane. (i.e., 
describe an experiment the results of which could be used to con- 
struct AS, for the state.) Include a step-by-step description of how 
the measurements are to be carried out. 

. The operators J, and J_ are given by Je = J, + iJy. 


(i) Construct spin-1 matrix representations of Jy and J_. Then oper- 
ate on the spin-1 states |1) and | — 1) with J-J}, and with J.J; 
explain why your results make physical sense. 
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(ii) Suppose you were to operate on the state |y) = gall) + zl 


with each operator, J_ Te. and Î j. Think about what you should 
get in each case, and write down your prediction. Then carry out the 
operations. Explain why your results make sense. 
(iii) Are any of the states that you acted on in Parts (i) and (ii) 
eigenstates of either a ver or Î J? How do you know? 

8. For a spin 1 system, the raising and lowering operators (in the z 


basis) are 


(i) Using Se S_, and Eq. (8.10), obtain the matrix representations 
of S, and ce in the z basis, for a spin 1 particle. 

(ii) From the commutation relations and the results of Part (i), find 
the matrix S. in the z basis. 
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The Time-Independent 
Schrodinger Equation 


They do not forgive Epicurus for having supposed, to account for the most impor- 
tant things, an event as small and insignificant as the minute declination of a 
single atom, in order to introduce with much cunning the celestial bodies, the liv- 
ing beings, and destiny, and for the purpose that our free will not be annihilated. 

Plutarch? 


Although our atomic concepts are nothing like those of Epicurus, our 
modern understanding of atoms largely does explain celestial bodies and, to 
a lesser extent, living beings. As for explaining destiny and free will—don’t 
get your hopes up. 

It’s all too easy to take our scientific achievements for granted. Basic 
atomic structure is now routinely taught in high-school. But our under- 
standing of atoms is, in fact, the crowning achievement in a 2,500 year 
quest to understand the constituents of the physical world. Yet almost all 
of that understanding has come about since the 1920s—that is, since the 
advent of quantum mechanics. 

The time-independent Schrödinger equation (TISE) is the quantum tool 
that ushered in that understanding. Because it explains atomic, molecular, 
and even nuclear structure, much modern physics and chemistry centers 
on this equation. Many quantum texts introduce the TISE early on, and 
greatly emphasize its application. I have not. Why? 

Important as it is, the TISE concerns only one area of quantum 
mechanics. Our overarching goal has not been to grasp quantum mechan- 
ics’ applications, but its structure—the conviction being that this is best 
accomplished through a general treatment. With that general treatment 
now in hand, we can understand time-independent systems within the 
broader context of the quantum formalism. 


1 Pullman (1998), p. 39. 
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9.1 An Eigenvalue Equation for Energy 


Although you may have heard that the Schrödinger equation is the key to 
quantum mechanics, I’ve said precious little about it, apart from a brief 
introduction in Chapter 2. The time-dependent Schrödinger equation is 


ih = HW. (9.1) 


Here Y is a quantum state (I could have written it as the ket |W)). Equation 
(9.1) is a general law of quantum mechanics. Thus, no reference is made to 
the type of quantum system that Eq. (9.1) describes. 

As prelude, consider Newton’s second law:? He = md, where Pe 
is the net force on an object, m is its mass, and @ is its acceleration. 
The second law’s simple appearance is deceptive: it can describe an end- 
less variety of physical systems, corresponding to different net forces. And 
depending on the net force, the second law can easily become difficult—even 
impossible—to solve exactly. 

Similarly, Eq. (9.1) may look simple, and not particularly interesting. 
But the Hamiltonian operator, al , can take on an endless variety of forms. 
So, just what is this A? 

In classical physics, the term “analytical mechanics” refers to various 
formulations of mechanics which do not rely on Newton’s second law. 
In one of these, Hamiltonian mechanics, a central role is played by the 
Hamiltonian, H = H(z2,y,2z,pz,Py,pz,t), a function of the coordinates, 
the momenta, and possibly the time.’ In-depth discussions of Hamiltonian 
mechanics are readily available; all we need to know is that, for almost all 
cases, H = K+V. That is, H is simply the sum of the kinetic and potential 
energy functions, K and V. 

To obtain the quantum operator H , we “quantize” the classical H by 
substituting in appropriate quantum operators. Then we can write down 
Eq. (9.1), 


OV (x, yY, Z, t) 


ih = HV(x,y,z,t 9.2 
Ot ( p Y, ? J; ( ) 
and substitute in our particular H. In one sense, Eq. (9.2) is a differential 
equation; in another, it’s a limitless collection of differential equations! (If 
my use of a partial derivative confuses you here, see note 11 of Chapter 5.) 

Let’s first consider the simplest possible case: a free particle in one- 


dimension. The fundamental (and fundamentally important!) classical 


relation F = —VV becomes, in one-dimension, F = ©. For a free 


2 The second law actually is: Fic = dp/dt. But for constant m, this equation reduces 
to the familiar form Fret = mda, which is adequate for our purposes. 

3 Here H is written in terms of Cartesian coordinates, but much of analytical mechan- 
ics’ power involves using alternative coordinate systems—a subtle topic that I won’t 
discuss. 
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particle, F = 0, so oe = 0, that is, V is a constant. With no loss of 


generality, we can choose V = 0.* The classical Hamiltonian is then just 
H = K = p*/2m. To quantize H, we substitute the operator p for p, that 


is, p > p= —iħ 5z o . Thus, Ê = z 2, and Eq. (9.2) becomes 


m Ox 
o ih 0? 
—wv = = V(a,t). i 
Tle t) = yatla, t) (9.3) 
If the force is non-zero, things get more interesting. For a classical 
harmonic pega the force is F = —ka, with k a constant. Then 
2 
-kx = —&, or V = $k”, so we have H = £- + $ka?. For the quantum 
oscillator, “i = = = a + ake and the Schrödinger equation becomes 
o —h? 6 1 
ihR—WV(a,t) = ——~WV(a,t) + kxr’ t(x, t). 9.4 
hE Yle, t) = F— U(0,t) + katla, t) (9.4) 


In principle, an infinite variety of classical forces, and thus of potential 
energy functions V, could exist—each corresponding to a different H. These 
classical forces, and Vs, could also be time-dependent, leading to time- 
dependent Hs. But to obtain the TISE, we specify that His independent 
of t, that is: H 4 H(t). 

From above we see—even for the simplest case of a free particle—that 
when we substitute in an actual H we obtain a partial differential equation. 
But because we now take H 4 H(t), we can apply separation of variables 
to our differential equation.” We assume W(z,t) may be written as the 
product of a function T, which depends only on t, and another function W, 
which depends only on x: Y(x,t) = T(t)y(x). Then Eq. (9.1) becomes 


inya) ZË -= re) Aron) 
fae eae y(x). (9.5) 


Here I’ve used the fact that 4 does not alter y(x), and Ĥ does not 
alter T(t). Because the left-hand side of the (second) equation above now 
depends only on t, and the right-hand side depends only on zx, both sides 


must be equal to the same constant. As such, we can write 


Hy(x) = E(x), (9.6) 
Equation (9.6) is the TISE. 
4 We know from elementary classical physics that changing the zero point of potential 


energy has no effect on the physics. 
5 For details, consult most any discussion of partial differential equations. 
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Now, what’s this E that appears in Eq. (9.6)? The separation of 
variables process only tells us that E is some constant, not its physical 
significance. Clearly, however, Eq. (9.6) is an eigenvalue equation, with 
E the eigenvalue. We know that the classical H corresponds to the total 
energy, so we recognize Eq. (9.6) as an eigenvalue equation for energy. 

We saw that Newton’s second law, Ts = mda, is deceptively sim- 
ple—and so is Eq. (9.6). This is due in part to the fact that Eq. (9.6) is 
restricted to one dimension in Cartesian coordinates. The key point, how- 
ever, is that H can take on an endless variety of forms, each corresponding 
to a different differential equation: Eq. (9.6) is very rich indeed! 

Equation (9.6) arises from the spatially dependent part of Eq. (9.5); 
another equation arises from its time-dependent part. Because, again, both 
sides of Eq. (9.5) must equal the same constant, E, we may write 

dT(t) —i 

ae z ETO. (9.7) 
This first-order differential equation is easily integrated, yielding the 
solution 


T(t) = Aexp(—iEt/h), (9.8) 


where A is an arbitrary constant. T he full solution to the time-dependent 
Schrödinger equation (but still with H # H(t)) is, therefore, 


U(x, t) = T(t)b(x) = Aexp(—iEt/h)v(z). (9.9) 


Pll discuss this expression for Y(x,t) later in this chapter. Moreover, it 
will be central to our discussion of time-dependent quantum systems in 
Chapter 11. 


9.2 Using the Schrödinger Equation 
9.2.1 Conditions on Wavefunctions 


I remarked that Eq. (9.1) is a general law of quantum mechanics. In Eq. 
(9.2), however, a restriction has already been imposed: the assumption 
that we’re working in position representation, that is, Y = W(x, y, z,t). 
This occurred because we started with the classical H = H (x,y, z, t). 

If we’re working in position representation, we’re working with wave- 
functions. Some books concentrate heavily on wavefunctions; I have not. 
The reason, again, is that our goal has been to develop an understanding 
of the general structure of quantum mechanics, and wavefunctions are but 
one particular type of quantum state (albeit an important type), in the 
sense that they are written in one particular representation. 

When partial differential equations appear in classical physics, we 
generally have to impose boundary conditions—that is, the actual phys- 
ical constraints that the system satisfies—to obtain physically acceptable 
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solutions. For example, if we wish to determine the electric potential 
in some region, we solve Laplace’s partial differential equation—but to 
eliminate physically unacceptable solutions, we must impose boundary 
conditions. 

Wavefunctions, also, satisfy physically reasonable conditions. First, we 
insist that wavefunctions, like other quantum states, be normalized: the 
total probability must equal 1. We also insist that a wavefunction, along 
with its first derivative, be continuous.® 

With these constraints on acceptable wavefunctions in hand, let’s briefly 
outline how the TISE is typically used in practice. 


e Determine the classical Hamiltonian, H, for the system of interest. 

e Obtain the Hamiltonian operator, H , corresponding to H. 

Substitute H into the time-independent Schrödinger equation (Eq. 
(9.6)). 

e Solve the resulting differential equation. 

e Impose continuity of the wavefunction and its first derivative. 

e Normalize the wavefunction 


It is the imposition of continuity on the solutions and their first derivatives 
that leads to a discrete, or “quantized”, set of eigenvalues and eigenstates.” 


9.2.2 An Example: the Infinite Potential Well 


In Section 4.4.2 we briefly discussed the infinite potential well, or “particle- 
in-a-box.” This system consists of a potential that’s zero for 0 < x < L, 
and infinite elsewhere. Because solving the Schrodinger equation typi- 
cally becomes very challenging very quickly, I will focus on this system, 
since it exhibits the generic features I want to illustrate while remaining 
mathematically simple. 

The infinite potential well clearly is not a physically realistic system: 
real potentials cannot be infinite, nor can they change infinitely fast. 
Because of these unphysical features, the wavefunction conditions for this 
system differ from those discussed above. 


e Continuity of the wavefunction still applies, but continuity of its first 
derivative does not (see note 6). 


6 Write Eq. (9.6) as: d?¢(x)/dx? = 2m/h?(V — E)w(x). If V, E, and y(x) are finite, 
d?1(x)/dx? must be finite; this holds only if d(x) /dx is continuous. See Bohm (1951), 
p. 232. For infinite V (as in Section 9.2.2), this argument fails, so dw(x)/dx can be 
discontinuous. 

T “Quantize” is one of those slightly dangerous words in physics whose meaning 
depends on context. Here it means that the energy eigenvalues constitute a discrete 
set, rather than a continuous distribution. Earlier, however, “quantize” meant creating 
a quantum-mechanical operator from the corresponding classical quantity (such as the 
Hamiltonian). 
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e The wavefunction vanishes identically where the potential is 
infinite.® 


With those modifications in mind, let’s plunge ahead. 

This system has distinctly different potentials in different regions of 
space. This is often the case for the TISE; the prescription is to solve for 
the wavefunctions in each region separately, and then impose our conditions 
on the resulting wavefunctions. 

In the case at hand, the wavefunctions cannot exist in the regions of 
infinite potential. Thus, we need to solve the TISE in the region 0 < x < L, 
that is, the region of zero potential. Here the TISE is simply that of a free 
particle: 


-h d d? —2mE 
S qe) = Eele) or FV) = Sve). (9-10) 


It’s easy to see that the general solution to Eq. (9.10) is 


y(x) = Acos (V2mE R x) + Bsin (2m R x) (9.11) 


where A and B are two arbitrary constants (which must appear in the 
general solution to a second-order, ordinary differential equation). 

Now impose continuity of the wavefunction. We know that y(x) vanishes 
for x < 0 and for x > L. Thus, the solutions to Eq. (9.10) must vanish 
(1) at « = 0, and (2) at x = L. Condition 1 implies that we must set 
A = 0. Condition 2 implies that the argument of the sine function must 
satisfy 


2mME,/h? L= nr, n=1,2,3,.... (9.12) 


Here E has been replaced with E,, because there is not just one energy 
eigenvalue that satisfies condition 2, but a discrete set of them. Solving 
Eq. (9.12) for the Ens yields 


2 272 
En = Sy n=1,2,3,.... (9.13) 


Corresponding to the E,,s is a set of energy eigenstates, the Yp (x)s 
n(x) = Bsin( V2mEn/M? x) = Bsin(nrx/L). (9.14) 


Note that it was the imposition of the wavefunction conditions above that 
resulted in “quantized” energy eigenvalues and eigenstates. 


8 See, for example, Liboff (2003); Schiff (1968), pp. 32-33; Townsend (1992). Vanishing 
of the wavefunction for infinite V may be justified by examining the wavefunctions of 
the finite potential well in the limit V — oo. 
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We can easily determine B from the normalization condition, 


oo L 
=f jena) Pde = 2? f sin? (nrz/L)dz = BPL/2. (9.15) 


Thus, B = ,/2/L, and our normalized eigenstates are 


Pnl) = Ve sin (nrz/L), n=1,2,3,.... (9.16) 


Though the infinite potential well is far from trivial, it is one of the 
simplest energy eigenvalue problems. Most other such problems are far 
more difficult. Nevertheless, the additional difficulty is largely “just” math- 
ematics, and our primary interest lies not in applied mathematics, but in 
attaining a clear conceptual understanding. That is what I turn to now. 


9.3 Interpretation 
9.3.1 Energy Eigenstates in Position Space 


The first two wavefunctions of Eq. (9.16), along with the corresponding 
probability densities, are plotted in Fig. (9.1). The plots provide a graphic 
representation of these energy eigenstates, and they’re a good vehicle by 
which to better understand the role of the TISE and its solutions. 

Let’s first recall where the states of Eq. (9.16) came from. The TISE 
is an energy eigenvalue equation, and it holds in any representation in 
which we choose to work. Typically, though, we set up and solve the TISE 
in position representation. But this is not the natural representation for 
energy. Because this point is easily misunderstood, let’s set up an analogous 
but simpler situation. 

Suppose that for some system only two basis states are required to form 
a complete, orthonormal set, and that two such bases are the a basis, |a1) 


and |a2), and the 8 basis, |81) and |G2). Suppose, further, that the a states 
(a) (b) 
go g 
0 L 0 0 7 L 


Fig. 9.1 (a) The n = 1 (solid line) and n = 2 (dotted line) wavefunctions 
for the infinite potential well, (b) the corresponding probability densities. 


118 The Time-Independent Schrödinger Equation 


(but not the 8 states) are eigenstates of an operator A, that is, 
Alaj) = a;|0,). (9.17) 


Equation (9.17) is representation-independent: though it seems most nat- 
ural to work in the a basis, Eq. (9.17) must hold in any suitable basis. If 
we determine how A appears in the 8 basis, and how to expand lay) and 
|a@2) in terms of |81) and |82}, we can write out Eq. (9.17) in the 8 basis, 
and it must hold. 

Matrix mechanics clearly illustrates the difference between an eigen- 
value equation when not in its eigenbasis, and when in its eigenbasis. 
Writing out Eq. (9.17) in matrix form for the eigenstate |a,), first in the 8 
basis and then in the a basis (the eigenbasis), we have 


(Ba) BUC )-9(k) es 


(a LGG) ew 


Here we’ve taken |a1) = b1|G1) +b2|G2). Both of the above matrix equations 
are manifestations of Eq. (9.17), but in two different representations 
(bases). 

When Eq. (9.17) is written in its eigenbasis—as in Eq. (9.18b)—the 
forms of both A and |a;) are simple: A has its eigenvalues on the diagonal, 
and zeros elsewhere, while |a;) consists of a 1 and a 0. But when written 
in the @ basis—as in Eq. (9.18a)—A’s diagonal elements are not just the 
eigenvalues, and A has non-zero off-diagonal elements. Moreover, both of 
|a1)’s elements are now non-zero—that is, the eigenstate |a1), when written 
in the |8} basis, becomes a superposition.’ 

Returning to the infinite potential well, the TISE’s solutions, Eq. (9.16), 
are eigenstates of H , but superpositions when written in position repre- 
sentation. From Chapter 2 we know that position eigenstates are Dirac 
deltas, for example, (x — xo) is the position eigenstate for the position 
xo. Therefore the states of Eq. (9.16), though eigenstates of energy, are 
superpositions of (infinitely many) position eigenstates. (As also discussed 
in Chapter 2, the expansion coefficient for each position eigenstate is simply 
the wavefunction’s value at that position.) 


9.3.2 Overall and Relative Phases 


Any complex function (i.e. a function with both real and imaginary parts), 
say f(x), can be written in polar form: f(x) = R(x) exp(iS(«)/h). Here 
R(x) and S(x) are real functions, called the modulus and the phase, respec- 
tively. Although the phase plays a key role in quantum mechanics, that role 


9 This should all sound vaguely familiar; cf. Section 6.4. 
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is not easily explicated. We’ll discuss the phase in more depth in the next 
chapter, but a brief acquaintance now is worthwhile.!° 

A general rule in quantum mechanics (which PI discuss in more detail 
in the next chapter) is that an overall phase doesn’t alter a quantum 
state. This is tantamount to saying that an overall phase cannot alter any 
probabilities. 

But just what do I mean by an overall phase? An overall phase is one 
which is the same for every component of the state. Suppose that in our 
example above, bı = et! /\/2 and bz = e’©?/,/2, with Ci and Cy real, and 
Ci A Co. Then 


iC, iC2 
Jay) = Tz i Tg Pe) AC: (9.19) 


Here |a 1) exhibits a relative phase—one that depends on the basis state. 
Now suppose, instead, that C1 = C2 = C. Then 


et’ 1 
ai) = (161) +18) — = (16) + 162)). 9.20 
lan) = Sg (160 + 182) — (16o) + 1) (9.20) 
Because e’© is an overall phase—the same for each basis state—it was 


dropped on the right-hand side. The arrow is meant to indicate that the 
two expressions are identical quantum states, even though they are not 
mathematically equal. 

What has any of this to do with the TISE? Solutions to the TISE often 
are real (as for the infinite potential well), or have, at most, an overall (not 
relative) phase.'! Such solutions are, effectively, real wavefunctions. Thus, 
learning quantum mechanics primarily through the TISE risks instilling 
misconceptions due to a focus on real wavefunctions. 

Although wavefunction plots (such as Fig. 9.1) can be illustrative, they 
can also obscure the fact that such plots are possible only because the 
wavefunction is not complex. And the complex nature of quantum states, 
including wavefunctions, is critically important (as we’ll discuss in following 
chapters). 

Remember that the starting point in this chapter was the time- 
dependent Schrödinger equation. Thus, solutions to the TISE must be only 
part of the story. As we saw in Eq. (9.9), the full state must include a factor 
of exp(—iEt/h). 

Just what effect will this time-dependent part of the state have? The 
answer is: None! The factor exp(—7Et/h) has no dependence on x. Thus, it 
is simply an overall phase for our position-space wavefunctions, and it can 
be dropped with impunity. It would seem, then, that the time-dependent 


10 Chapter 10 and Appendix A discuss the basics of complex numbers and functions. 
11 In position representation, this means the phase doesn’t depend on the spatial 
variable(s). 
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part of the state is of no interest. And so long as we focus only on energy 
eigenstates, that’s correct—and it’s why many discussions of the TISE 
largely ignore the phase. 

But what if a system is in a superposition of energy eigenstates? That’s 
absolutely legitimate—and it can change everything. In a superposition 
of energy eigenstates, each eigenstate generally has a different energy 
eigenvalue, and thus a different factor exp(—iEt/h), associated with it (con- 
sider the infinite potential well). This fact will play a key role in discussing 
time evolution in Chapter 11. 


9.4 Potential Barriers and Tunneling 
9.4.1 The Step Potential 


Every day, in bars and pubs across the globe, “experiments” in classical 
scattering are conducted: whenever one billiard ball is accelerated towards, 
and then strikes, another, a simple classical scattering event has occurred. 

Billiard balls exert forces by contact. A more interesting type of classi- 
cal scattering arises from forces that extend through space—for example, 
two macroscopic, electrically charged objects. If, say, object 1 is very mas- 
sive compared to object 2, and initially stationary, then object 1 remains 
essentially stationary during the scattering event. In that case, it’s a good 
approximation to represent object 1 by an electric potential, or equivalently 
by an electric field, that’s fixed in space. Often we can do the same thing in 
quantum mechanics, treating scattering as the interaction between a par- 
ticle and a fixed potential. Both conceptually and calculationally, though, 
the quantum case is far more subtle.!? 

The simplest case of quantum scattering is that of a “particle” traveling 
through force-free space towards a potential step, or barrier, of height Vo 
(see Figure 9.2).13 The particle then scatters off of (interacts with) the step. 
This is a standard textbook problem—often found (as here) in discussions 
of the time-independent Schrödinger equation.'4 

But wait! Scattering is clearly a time-dependent process—why discuss it 
in the context of time-independent quantum mechanics? First, because the 
step potential provides a simple application of the TISE. And because, 
as we'll see, step-potential scattering can be analyzed using the TISE, 
although only for a special case, and only after some rather subtle con- 
ceptual leaps have been made. (Moreover, the analysis is far simpler than 
using the time-dependent Schrédinger equation.) 


12 Our goal is to illustrate some features of quantum mechanics, not to discuss 
scattering per se—a vast subject, which most standard quantum texts treat in some 
depth. 

13 Speaking carefully, in quantum mechanics it’s not a particle that’s travelling, but a 
wavefunction—a quantum state—from which we may extract measurement probabilities. 

14 Many “modern physics” texts use the TISE to discuss both step-potentials and 
tunneling. 
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Region 1 Region 2 


x=0 x 
Fig. 9.2 The step potential; for Region 1, x < 0, while for Region 2, x > 0. 
Before discussing the step potential as a scattering problem, however, 


we'll examine it as an energy eigenvalue problem. The Hamiltonian operator 
for the step potential is: 


a2. 2 42 

~ Ë -d ; ; 

= om — Dm da? Be rs 

g p? — h2 2 
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In both regions, solutions to the TISE are of the same form: ẸYı(x) = 
Apetit 4 A_e~*® and Wo(x) = Bye*2* + B_e~**2” (1 and 2 denote 
regions; A,, A_, B}, and B_ are complex constants). Substituting these 
into the TISE easily yields the system energy, E: 


Rk? Rk 
E=- += 24V. (9.22) 
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Thus, kı and kə satisfy: 


ImE JIm(E — 
oe AEN (9.23) 


If E > Vo, then both kı and kg are positive, real numbers, and kı > kə. 
Moreover, from e**** = cos(ax) + isin(ax) (see Section 10.1.2), we see 
that both the real and imaginary parts of Yı (x) and W(x) are oscillatory 
(though with different frequencies for Yı (x) and W2(z)). 

But what if E < Vo? Then kı remains real and positive, but because 
2m(E — Vo) is now the square root of a negative number, k2 becomes 
imaginary: ky = ti,/2m(Vo — E)/h. Thus, the exponential in Yə(x) 
becomes real: Ya(x) = Cpe"? + C_e~"*, where k = \/2m(Vo — E)/h. 
Though fine mathematically, this seems physically absurd: How can the 
total energy, Æ, be less than the potential energy, Vo? Good question: but 
for now, Pll defer answering it. 

Evidently we have two distinct cases: E > Vo and E < Vo. For the 
E > Vo case, the most general energy eigenstate would include all four 
possible constituent states: Yı (x) = Ape? + A_e~*1®, and Wo(r) = 
Bett? 4 Be ae, 
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For the E < Vo case, also, there appear to be four possible states: 
P(x) = Aye + Aet, and Wo(x) = Cpe"? + C_e—**. However, 
because « is a positive real number, V2(a) = C,et** diverges (approaches 
infinity) as 2 — oo, and so is not an acceptable wavefunction. 

We could now finish constructing our energy eigenstate wavefunctions 
for each case by imposing continuity of the wavefunction and its first deriva- 
tive at the boundary between the regions, that is, at x = 0.15 This is a fairly 
straightforward exercise—one Ill leave to the standard textbooks.'® 


9.4.2 The Step Potential and Scattering 


So far, we’ve treated the step potential as an energy eigenvalue problem. 
That is, we’ve applied the TISE to the step potential to obtain the energy 
eigenvalues and eigenstates (in position representation). But how is this 
relevant to the time-dependent process of scattering? 

To correctly model scattering, the initial state must be a wavepacket—a 
state with finite spread in both position and momentum (e.g., a Gaus- 
sian)—which is time evolved until interaction with the potential effectively 
ends. Wavepackets must obey the position-momentum uncertainty rela- 
tion: AvAp, > h/2. This means, roughly, that the price to be paid for a 
narrow momentum probability distribution is a wide position probability 
distribution, and vice versa. 

In the limit of an infinitesimally narrow momentum distribution—a 
momentum eigenstate—the corresponding position distribution is maxi- 
mally spread out. Such a state is physically unrealizable, but states with 
well (but not perfectly) defined momenta are realizable. A momentum 
eigenstate is a good approximation to such a state. 

In Region 1, the step potential’s energy eigenstate is Y (x) (see Section 
9.4.1). Both “pieces” of Yı(x), that is, Aye” and A_e~™*, are spread 
out over all of Region 1, and each has a well-defined momentum (to check, 
let p = —ih d/dz act on each). Thus, we can think of A,e'*!” as represent- 
ing an ensemble of particles, all with momentum p = +hk (and similarly 
for A_e~*™”, but with p = —hk,). For the E > Vo case, similar arguments 
apply in Region 2. Thus, if we solve the time independent Schrödinger 
equation for the step potential’s energy eigenstates, we’ve obtained a sort 
of “steady-state solution” to the time-dependent scattering problem. 

Consider the EF > Vo case in detail. Because we’re modelling the interac- 
tion of an incoming beam of particles with a potential step, we can impose 
an additional condition on our wavefunction. Physically, we have a beam of 
particles incident from the left, but no such beam is incident from the right. 


15 For the step potential, V is discontinuous (as for the infinite potential well), but 
it is not infinite, so continuity of the derivative applies in this case. See note 6. 

16 See, for example, Liboff (2003), Section 7.6; Shankar (1980), Section 5.4; Townsend 
(1992), Section 6.10. 
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(a) E>Vy (b) E< V 
x= 0 x=0 


Fig. 9.3 Real parts of the (complex) scattering wavefunctions for the step 
potential, for (a) E > Vo, and (b) E < Vo. (The imaginary parts are simi- 
lar.) Arrows indicate the momenta in the two regions. Such representations 
are idealized: physically realistic states would be localized wavepackets, not 
infinitely extended, time-independent waves. 


Therefore, although B_e~‘*?* is mathematically acceptable in WV2(z), it’s 
not physically acceptable, and must vanish (i.e., we set B_ = 0). 

Now let’s cheat just a bit by thinking in terms of particles. In Region 1, 
incident particles move in the +x direction with kinetic energy E; transmit- 
ted particles, in Region 2, also move in the +z direction, but with kinetic 
energy E — Vo.!" So far this seems quite classical. However, even though 
E > Vo, some of the incident particles are reflected at x = 0, traveling in 
the —a direction with energy E.'® This effect is clearly non-classical—and 
somewhat surprising. (See Fig. 9.3 (a).) 

For the E < Vo case, things get more interesting. A classical particle 
with E < Vo could never enter Region 2. But the wavefunction does. What 
does this mean for the quantum particle? 

First, even the question is dangerously misleading. In quantum mechan- 
ics, it’s not a particle that enters Region 2, but a wavefunction—not the 
same thing! Because the state has non-zero amplitude in Region 2, there’s 
a non-zero probability of finding the particle there upon measurement. But 
how do we “think about” the particle in Region 2? That’s not clear, and 
it’s why a question mark appears in Fig. 9.3 (b). 

One problem with the conceptual picture we’ve formed is that we only 
suspend our classical judgement when convenient. In Region 2, a particle 
beam moving to the left is eliminated because, classically, no such beam 


17 This reduction in kinetic energy, and thus speed, occurs instantaneously. This phys- 
ically impossible effect arises from the physically unrealistic discontinuous change in 
potential. 

18 Recall that, in Region 1, the wavefunction is comprised of both Aye? and 
A_eatkie, 
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exists. Yet for the E > Vo case, a reflected beam is retained in Region 1, 
even though reflection is prohibited classically. Similarly, for the E < Vo 
case, the wavefunction enters Region 2, even though a classical particle 
can’t. 

Arguably, such conceptual questions result from the fact that we are 
solving the problem as an approximation to, and idealization of, the real 
situation. The proper way to gain a clear understanding of the physics, and 
thus justify our conceptual picture, is by examining the real, wavepacket- 
based scattering problem. Pll return to this topic briefly in Section 12.3.2. 


9.4.3 Tunneling 


Because in Section 9.4.2 we obtained a time-independent state—quite dif- 
ferent from the time-dependent wavepackets of the “real” problem—it may 
seem that we’ve done little of physical interest. But we can still address 
a key physical question: For some incident particle beam, what is T, the 
fraction of particles transmitted, and R, the fraction reflected? 

The process of matching boundary conditions at x = 0 (which I dis- 
cussed, but didn’t actually carry out, in Section 9.4.1) determines the 
relative sizes of the constants A}, A_, B} (for E > Vo) and C_ (for 
E < Vo). The ratios |A_|?/|A+|? and |B,|?/|A+|? are the probability den- 
sities for the reflected and transmitted beams, respectively, relative to the 
incident beam.!? To obtain T and R, however, these ratios must be appro- 
priately weighted by the speeds of the particles in the two beams (I again 
leave details to the comprehensive texts).7° 

Now consider a somewhat different potential: a step potential of finite 
length, say L, returning to V = 0 for x > L (see Fig. 9.4). For x < 0 and 
for x > L, the wavefunction is similar to that for the step potential, that 
is, Ape™* + A_e~**® for x < 0, and Bye” for x > L. Again invoking a 
particle picture, for x < 0 and x > L, particles move with kinetic energy E. 

What happens in the barrier region? For the case E > Vo, a particle’s 
kinetic energy is reduced to E — Vo. As for the step potential, some of the 


V(x) 


x=0 x=L x 


Fig. 9.4 The rectangular potential barrier. 


19 There is no transmitted beam for the E < Vo case. 
20 Rigorously, this is accounted for with the probability current density, or probabil- 


ity flux. 
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particles are reflected at x = 0. In addition, another reflection can occur 
at x = L. Apart from these non-classical reflections, however, the overall 
picture agrees fairly well with our classical expectations. 

When we consider the E < Vo case, however, things go from inter- 
esting to profound. Because the potential barrier is of finite extent, the 
wavefunction in the barrier, though rapidly decreasing, is non-zero at L. 

What happens at L? Again, continuity of the wavefunction and its first 
derivative must hold. This means, in part, that because the wavefunction 
within the barrier is non-zero at L, so must there be a non-zero wavefunc- 
tion for x > L. But this means that there is a non-zero probability that 
particles pass through to the other side of the potential barrier, where they 
are again free particles. 

Using techniques like those applied to the step potential, we can again 
arrive at transmission and reflection probabilities. But our real interest is 
in the conceptual analysis of the situation. 

For both the step potential and the rectangular potential, the physi- 
cal meaning of the wavefunction’s penetration into the forbidden region 
is unclear. For the rectangular potential, however, the end result of that 
penetration is clear—and quite astounding. Some of the particles incident 
from the left are reflected from the barrier. But others pass through the 
barrier, a region where they could not even ezist classically, and reappear 
on the other side! This is quantum-mechanical tunneling, and it has no 
counterpart in classical physics. 

By using the TISE to introduce tunneling, we avoided much of the 
mathematics that arises in a rigorous treatment using wavepackets and 
the time-dependent Schrödinger equation. But, as for the step potential, 
the conceptual aspects are perhaps more clear in the latter context.?! 


9.5 What’s Wrong with This Picture? 


Solving the time-independent Schrödinger equation constitutes a major 
part of quantum physics, and we’ve only scratched the surface. Important 
as the TISE is, though, its study can instill misconceptions, so let’s sum 
up how those misconceptions may take root. 

Newton’s second law, Fox = md, describes a wealth of classical 
physics. An important subset of classical physics comprises systems in static 
equilibrium. For such systems the second law reduces to Pnet =0. 

Although the acceleration now seems to have disappeared from the sec- 
ond law, it really hasn’t—it’s just that now & = 0. Still, if we focused on 
static equilibrium problems we might lose sight of the centrality of accel- 
eration in classical mechanics, since it would play no meaningful role in 
either our concepts or our calculations. 


21 See, for example, Goldberg (1967); Schiff (1968), pp. 106-107. 
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The time-independent Schrödinger equation describes a wealth of quan- 
tum physics. Energy eigenstates and eigenvalues play a crucial role in our 
understanding of the physical world, especially in nuclear, atomic, and 
molecular physics and chemistry. But for any individual energy eigenstate, 
time evolution amounts to nothing more than multiplication by an overall 
phase. In a sense, energy eigenstates do evolve in time, but in such a way 
that there are no physical consequences. 

The classical world doesn’t consist only of systems in static equilibrium. 
And the quantum world doesn’t consist only of energy eigenstates; rather, 
it is full of systems that do change in time, and we can’t understand these 
by studying the TISE alone. 

Precisely how can the TISE cause us problems down the road? First, 
solutions to the TISE (in position space) are often real wavefunctions, 
which can’t fully reflect the richness of quantum mechanics. In general, 
wavefunctions are complex. Unlike real functions, they simply can’t be 
drawn on paper.?? 

Moreover, even when solutions to the TISE are complex, that fact often 
plays little role in introductory treatments. After all, if we’re just going to 
complex square the state and obtain a real probability distribution, isn’t 
the complex nature of quantum states simply an unnecessary mathematical 
artifact? 

Finally, much interesting quantum mechanics arises from superpositions 
of energy eigenstates. And dealing with such superpositions requires us 
to deal explicitly with the complex nature of the wavefunction. Quan- 
tum interference, which gives rise to many uniquely quantum-mechanical 
phenomena, is crucially dependent on the complex nature of the state. 

In the next chapter, we will explore in depth the role of the phase, and 
the complex nature of the state. 


9.6 Problems 


1. Consider the n = 2 energy eigenstate of the infinite potential well. 
Suppose that, rather than adopt the statistical interpretation, we 
interpret this state as describing an actual, physical particle with a 
well-defined energy (which must all be kinetic). Carefully describe 
how the energy and position probability distributions (i) are, and/or 
(ii) are not in agreement with such an interpretation. 

2. Consider the following statement. For each sentence, state whether 
you agree with it, and justify your answer. 

Solutions to the TISE are energy eigenstates. These solu- 
tions are functions of position, x. Therefore, there is a unique 
energy corresponding to each value of x for such a state. 


22 Of course, one can plot, say, the real part of a wavefunction, or the imaginary part, 
or the modulus. But these are not the wavefunction itself. 


3. The potential energy of a classical harmonic oscillator is: V = —Kax 
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where « is a positive, real constant. 

(i) Set up (but don’t solve) the TISE for the corresponding quantum 
harmonic oscillator. 

(ii) Now assume there’s also a damping force of (—a/m)p, where m 
is the mass, a@ is a real constant, and p is the momentum. Repeat 
Part (i) for this case. 

. For a rectangular potential barrier, the methods of Section 9.4 predict 
that T, the fraction of particles transmitted, will be 


-1 


V sinh? («L) 
Pas ie E 
| 4E(Vo — E) | a 
V2 sin2(koL)] ~ 
geo (Ces pee ne i E> V 9.24 
| e) A (9.24) 


where «, kg, and L are defined in Section 9.4. To investigate T’s 
behavior, set m = h = 1, take Vo = 10, and plot T as function of: 

(i) energy, with 0 < E < 50, 

(ii) barrier width, with E = 20 and 0 < L < 2, 

(iii) barrier width, with E = 5 and 0< L <2. 

. Computer-generated “movies” of wavepacket scattering from a poten- 
tial barrier, including tunneling, can be helpful in forming physical 
pictures of such processes. Using Goldberg (1967), Schiff (1968) 
pp. 106-107, or other sources, investigate wavepacket scattering from 
a rectangular potential barrier. 

(i) In discussing both the step and rectangular potential barriers, we 
considered the E > Vo case, and referred to particles with kinetic 
energy E, or E — Vo. Is this language applicable for wavepackets? 
That is, does a wavepacket have a well defined energy? Justify your 
answer. 

(ii) Consider the TISE results for the step and rectangular barriers for 
the E > Vo case. Compare these results with wavepacket scattering. 
In what ways does the TISE solution accurately portray what happens 
with wavepackets? In what ways is it misleading or incomplete? 

(iii) Repeat Part (i) for the E < Vo case. 
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Why Is the State 
Complex? 


Born advanced [a theorem] which [was] destined to play a fundamental role in 
the further development of quantum theory, its interpretation, and its theory of 
measurement... the theorem of the interference of probabilities according to which 
the phases of the expansion coefficients, and not only their absolute values, are 
physically significant.! 


It was 1926—the dawn of quantum mechanics—when Max Born realized 
that quantum-mechanical expansion coefficients, and thus quantum states 
themselves, cannot simply be real quantities: they must, in general, be 
complex. 

But why? In one sense, this question is easily answered. The Hamilto- 
nian operator, H , is real,? so solutions to the time-dependent Schrödinger 
equation, 


(10.1) 


must be complex—as can be seen by assuming the contrary, that is, that 
W is only real or only imaginary. This does explain, formally, why the state 
is complex, but we would like to know more. What essential features of 
quantum mechanics depend on the state’s complex nature? How and when 
do these features arise? 

Such questions are often neglected, yet one can’t fully understand quan- 
tum mechanics until they’ve been addressed. To do so, we'll need a clear 
understanding of certain aspects of complex numbers, and developing that 
understanding will be our first goal. With that in hand, we’ll investigate 
the role of the phase in quantum mechanics. We’ll first see that the full 
range of physical possibilities can only be incorporated into complex states. 
Then we'll develop the difference between overall and relative phases, the 
effects of each on probabilities, and how such phases arise through unitary 
operators. 


1 Jammer (1966), p. 290. 
2 H can involve i, but this is not the usual case. See Schiff (1968), Section 20. 
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10.1 Complex Numbers 
10.1.1 Basics 


Grasping the implications of complex quantum states requires a basic but 
firm understanding of complex numbers. To that end, this section deals not 
with physics per se, but with mathematics. Although discussions of com- 
plex numbers are easily found elsewhere, the approach here is specifically 
designed to lay the groundwork for our later foray into the complex nature 
of quantum states.? 

The starting point for complex numbers is, of course, the imaginary 
number i, where i = \/—1. If Cı is a complex number, then by definition 
Cı has both a real part, denoted Re C1, and an imaginary part, denoted 
Im Cı. Both Re Cı and Im C; are real numbers; let’s agree to call them a 
and 8, respectively. Then we can write C4 as 


Pll call this the “standard form” for a complex number. 

Addition, subtraction, and multiplication of complex numbers are 
defined in the “obvious” way, that is, as they would be for real numbers. 
Suppose another complex number, C2, is given by C2 = y + ið. Then 


Cı + C2 =a+y+ilß +ô) 
Cı — C2 =a — y+ ilb — ô) 
C1 C2 = (a + ib) (y + iô) = ay — bô + i(ad + 8y). (10.3) 


It’s crucially important to recognize that real and imaginary quantities are 
like “apples and oranges”; they are fundamentally different. For example, 
if Cy = Co, then we must have a = y and 8 = 6; the real and imaginary 
parts of the sum must each be equal. Thus, C1 = Cə is effectively two 
equations. 

By definition, the complex conjugate (or just “conjugate” ) of some quan- 
tity D, denoted D*, is obtained by simply changing the sign of i everywhere 
it appears in D. For a complex number such as C4, then, CT = a — if. 

Moreover, |D|? is defined to mean “multiply D by D*, its complex 
conjugate.” So for |C,|?, we have 


ICi? = C10} = (a +: 18)(a — ib) = a? + 8°. (10.4) 


Simple. But there’s a potential problem. When dealing with complex 
quantities, one must be scrupulously careful to distinguish ordinary mul- 
tiplication, such as in Eq. (10.3), from multiplication of a quantity by its 


3 This section is intended for those with at least some familiarity with complex num- 
bers. If you have none, I suggest you consult a more systematic treatment. Also see 
Appendix A. 
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conjugate, such as in Eq. (10.4). Both operations are legitimate, and both 
could appear in a calculation—but they obviously don’t yield the same 
results, for example, C? 4 |C;|?. 


10.1.2 Polar Form 


Any complex number can be written in standard form, that is, in terms of 
its real and imaginary parts (as in Eq. (10.2)). Often, however, it’s more 
convenient to use an alternative form, called polar form. 

An ordinary vector A in the zy plane (Fig. 10.1(a)) can be fully spec- 
ified with just two pieces of information, such as A, and Ay, its x and y 
components. Instead, however, we could specify A’s length and its angle 
with respect to the x axis. Given A, and Ay, we can find the length and 
angle of A, and vice versa. Which of these two means of specifying A is 
more convenient depends on context. 

The complex plane is a plane consisting of a real axis, and an orthogonal 
imaginary axis—corresponding to the real and imaginary parts of a com- 
plex number. We can represent a complex number as a vector in this plane 
(see Fig. 10.1(b)).* Clearly, the complex plane is an abstract mathemati- 
cal creation. Unlike the xy plane, we can’t think of the complex plane as 
representing physical space. What’s important, however, is that the same 
rules apply for vectors (complex numbers) in the complex plane as apply 
to ordinary vectors in the xy plane. Let’s see how. 

Writing the ordinary vector A as A= Aĉ + AQ specifies A’s com- 
ponents. Similarly, writing a complex number in standard form, as in 
Eq. (10.2), corresponds to specifying the number’s components. In Figure 
10.1(b), œa and 8 are the components of C4. 


(b) 


Imaginary 


Fig. 10.1 The analogy between (a) a vector, A, in the xy plane, and (b) a 
complex number, Cj, in the complex plane. 


4 This doesn’t mean that one axis exists in reality and the other only in one’s 
imagination. 
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In Figure 10.1(b), we see that the “length”, properly called the mod- 
ulus, of Cy is \/Re C? +Im C? = \/a? +2. And just as the angle 
of A is tan~'(A,/A,), the “angle”, properly called the phase, of Ci is 
tan~'(Im C1/Re C1). By convention, the length or modulus is denoted R, 
and the angle is denoted @. 

Again referring to Fig. 10.1(b), we could write Cı in component form 
as Cı = R(cos@ + isin@). Using Euler’s formula,’ exp(+ix) = cos(x) + 
isin(x), then, we have, Cı = Rexp(i@). A complex number written in this 
manner—as the product of a real modulus, R, and a complex phase factor 
(with a real phase, 0)—is said to be in polar form. 

Polar form can be very convenient if a complex number’s modulus or 
phase are of interest, since they can then simply be “read off”. But beware! 
It’s easy to wrongly think that the modulus R and the phase factor et? are 
the real and imaginary parts, respectively, of a complex number. After all, 
that’s what they look like—until you write out the exponential and obtain 
Rexp(i#) = R(cos 0 + isin 0). 

By contrast, the modulus and phase of a complex number written in 
standard form can’t simply be read off, but its real and imaginary parts 
can. Again, which form is preferable depends on context. 


10.1.3 Argand Diagrams and the Role of the Phase 


Although complex numbers aren’t really vectors, we can represent them 
as such, as in Fig. 10.1(b). Such representations, called Argand diagrams, 
provide a useful means of visualization. 

The analogy between ordinary vectors and complex numbers extends 
to addition. Consider the sum of A and B , two vectors in the zy plane: 


A+ B= (Az#+ Ay) + (Bet + Bô) 
= (A, + By)é + (Ay + By)p. (10.5) 


Just as we added A and B by x and y components, so do we add complex 
numbers by real and imaginary components. For C + C2 we have 


Cy + C2 = (Re Ci + ilm Cı) + (Re Co + ilm C2) 
= (Re Cı + Re C2) + i(Im Cı + Im C2). (10.6) 


(Compare with Eq. (10.3).) This implies that, just as we can graphically add 
ordinary vectors in the xy plane, we can graphically add complex numbers 
in the complex plane. By putting this principle to work in an example, we 
can begin to see the crucial role of the phase. 

First, let’s again consider some complex number C, written in polar 
form as C = Rexp(i0). If we represent C as a vector in the complex plane 


5 This is an important identity. Be sure you are familiar with it. 
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Fig. 10.2 Addition of complex numbers. (a) The moduli of F, G, and F+G 
are unaltered by an overall phase change. (b) A relative phase change alters 
the modulus of F + G. Dashed lines represent quantities after phase shift. 


using an Argand diagram, the modulus R is the vector’s length, while 
the phase @ is the vector’s angle. What if we were to multiply C by, say, 
exp(iz/4), thus altering its phase? That is, 


C = Rexp(i@) — Rexp(im/4) exp(i0) = Rexpfi(7/4+0)]. (10.7) 


Clearly, this changes only the angle of the vector, not its modulus. 

What if we were to add two complex numbers? Let’s call the two com- 
plex numbers F and G, and let’s assume that they have equal moduli, R, 
but unequal phases, 0r and 6g. Figure 10.2(a) shows F + G graphically in 
the complex plane. If we now “shift” the phases of both F and G by the 
same amount ¢, we have, 


Re®r + Rec —, ReiOrt9) 4 ReiOctd), (10.8) 


This overall phase shift (i.e. the same for both F and G) changes the 
orientation of both vectors equally, so that their relative orientation is 
unaltered. As a result, the modulus of the sum, F + G, is also unaltered by 
this overall phase change (Fig. 10.2(a)). 

So, what’s the effect of a relative phase change—when the phase change 
for F differs from that for G? To answer this, it suffices to change the phase 
of just one of our complex numbers, say G: 


Re F + Ree — Retr + Reil@ct9), (10.9) 


Clearly, G now rotates in the complex plane, while F does not—and this 
changes everything. Now the modulus of F + G does change, even though 
the moduli of F and G individually are unaltered (Fig. 10.2(6)). It’s the 
phase that effected this for us: in fact, because we’ve taken the moduli of 
F and G to be equal, there’s some ¢ for which F + G vanishes identically! 
This illustrates the crucial role of relative phase changes. 
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Some remarks about terminology are in order. In physics, words some- 
times have completely different meanings depending on context; phase is 
one such word. In thermodynamics, phase refers to a state of matter, such as 
solid or liquid. In classical Hamiltonian mechanics, we work in an abstract 
mathematical space called phase space. In astronomy we refer to the phases 
of the moon. And the phase of a classical wave, such as a sound wave or 
a wave on a string, is the wave’s relative spatial position. For example, 
sin(x — 7) is identical to sin(x), apart from a shift in the +a direction by 
m due to the phase, —7. 

These meanings of “phase” seem wholly unrelated to the phase of a com- 
plex number, but that’s not quite true. If we add (superpose) two classical 
waves, their phases partly determine the resulting wave. For example, if we 
add two similar sine waves, the sum depends on their phases. If the waves’ 
peaks “line up,” as for sin(x) + sin(x), the sum is another sine wave, but 
with doubled amplitude. But if one wave’s peaks line up with the other’s 
troughs, as for sin(x) + sin(x — 7), the waves are exactly out of phase, and 
their sum vanishes identically. We’ve now seen that the phases of complex 
numbers determine how they add, so there’s at least a rough similarity 
between the phase of a complex number and that of a classical wave.® 

We’ve discussed complex numbers in some depth now. But quantum 
mechanics uses more than just complex numbers; as we emphasized in 
Chapter 9, it uses complex functions, which we’ve ignored so far in this 
chapter. However, almost all of our discussion about complex numbers 
translates to complex functions—a point that we’ll explore later. 


10.2 The Phase in Quantum Mechanics 
10.2.1 Phases and the Description of States 


In what follows, Pll illustrate the role of complex numbers in quan- 
tum mechanics. The underlying principle will be that, because quantum 
mechanics makes probabilistic predictions, only those changes to a quantum 
state that alter probabilities have physical significance—if no probabilities 
are altered, we really have the same state. 

Let’s consider the simplest case possible: a system for which only two 
states are required to form a complete, orthonormal set. A spin $ system 
is of just this type. As we saw in Chapter 8, a spin 4 basis set is formed by 
the states |1/2,1/2) and |1/2,—1/2); that is, l = 1/2 and m = 1/2, —1/2. 
If it’s understood that we’re working with a spin $ system (in the z basis), 
it’s common to use a simpler notation, in which we replace |1/2,1/2) and 
|1/2, 1/2) with | + z) and | — z), respectively. Because | + z) and | — z) 
form a basis, an arbitrary state of our spin 4 system can be written as 
c+| +2) +c_| — z), where c} and c_ are complex numbers. 


6 Beware that both this “wave phase” and the phase of a complex number can appear 
in quantum mechanics—and both are often simply (and ambiguously) called “the phase.” 
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The z component of the spin angular momentum, denoted S,, is an 
1 


observable quantity. For a spin 3 system, the corresponding Hermitian 


operator, S., satisfies the eigenvalue equation 


Sy ae 2y= 2 | a2), (10.10) 


so the possible measured values for S, are +2 and a Similarly, for the 
x and y components of spin angular momentum we have 


|y). (10.11) 


To even write down a quantum state we must, in general, allow it to be 
complex (i.e. to have complex expansion coefficients). Otherwise, we cannot 
account for all experimental possibilities. As a simple example, consider two 
spin 4 states, |A) and |B), written in the z basis: 


y= S)+2+4]-2), e (01g) 


S,|£2)=+3/42),  Sy/ty)= 


tl ot 


Clearly, if we measure S, with the system in either |A) or |B), we have 


Prob(S =!) =>,  «Preb(S.==").= 5. (10.13) 


That is, |A) and |B) are indistinguishable with respect to S, measurements. 
Yet |A) and |B) are definitely not the same state. It turns out that 
transforming |A) and |B) into the x basis yields 


|A)=|+2), |B) =|-2). (10.14) 
So if we measure S,, rather than S,, on |A} and on |B), we obtain 

Prob (Sz = Bl A) =) 1, Prob (Sz = |A) =0 

Prob (Sa = Ë|B) =0, Prob (S, = |B) =1. (10.15) 

The probabilities for Sẹ measurements for |A) are opposite those for |B)! 
The complex nature of the state is what’s done this for us. 

But wait! you protest. The coefficients of |A) and |B) in the z basis were 

real—there was nothing complex, or even imaginary, in sight. Right; and 

wrong. The two states differ only by a sign change in the coefficient of |— 2). 


But —1 can be written as a complex number: —1 = et" = cos (7) + isin (7), 
so that 


in 


|B) 


LE 
T 


V2 


= ylt 2) z). (10.16) 


Thus, we may regard our sign change as a phase change in the coefficient 
of | — 2). 
Perhaps this seems like a ruse. But consider another state: 


IC) = zl t2 + +|—2z). (10.17) 
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The state |C} is, in fact, | + y} in disguise—and distinct from either | + x) 
or | — x).” The probabilities for S, measurements in |C} are the same as in 
|A) or |B) (see Eq. (10.13)), but the obviously unreal coefficient of | — z) 
in Eq. (10.17) leads to different probabilities for other measurements.® 

In fact, infinitely many distinct spin Z states exist that obey Eq. (10.13). 
All such states must have |+ z) and |— z) coefficients of modulus Fas So as 
to satisfy Eq. (10.13), yet all must be different quantum states. How can we 
possibly satisfy these two requirements? The answer is: by multiplying the 
modulus of one of the coefficients by a complex phase factor e’%, where g is 
a different real number for each state (which is just what differentiates | A), 
|B), and |C)). Evidently what distinguishes these states from each other is 
not the coefficients’ moduli, but their phases. 

Often in quantum mechanics we may deal with states, such as |A) and 
B), whose coefficients are real. But real coefficients can always be written 
as complex numbers. A real number N can always be written as Net, and 
—N as Ne’. 

In fact, for consistency, and to impart a deeper understanding of 
quantum states, one could think of all quantum-mechanical coefficients 
as complex numbers. In some cases, however, the imaginary part of a 
coefficient will be zero, so it may be written as a real number. 

The situation is similar for wavefunctions. There are infinitely many 
wavefunctions with identical moduli—and thus identical position proba- 
bility distributions—but different phases. These are thus different states, 
with different momentum probability distributions (as is discussed in 
Chapter 12). 

Evidently the phase plays a crucial role in quantum mechanics. And 
phases can not only exist in the description of a quantum state, they can 
be introduced—as we’ll soon see. 


10.2.2 Phase Changes and Probabilities 


We now consider three types of changes to the coefficients of our arbitrary 
spin 4 state, c,|+z)+c_|—z), and ask whether they can alter probabilities. 
An overall phase change of, say, e’*, with a constant, leads to 


e+|+z)+¢_| - 2) ec4| +z) + e'%e_| 2). (10.18) 


Note that: [cy]? = |e**c;|?, and |c_|? = |e’*c_|?. That is, the overall 
phase change leaves the (complex) squares of the coefficients, and thus the 
measurement probabilities for S,, unaltered. 


7 Writing i = et7/2 
case, |C}. 


8 For example, Sẹ measurements in |C} satisfy: Prob(Sz = £) = Prob( Sz = =) = 5. 


, we again see that a phase change created a new state; in this 
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A relative phase change may be effected by changing just one of the 
phases 


e4| +2) +e_|— 2) ec4|+2)+c_|—2). (10.19) 


We might expect that the probabilities for S, would be altered by this rel- 
ative phase change. Clearly, though, the complex squares of the coefficients 
of |+z) and |—z), and thus also the S, probabilities, are unaltered. As we’ll 
soon see, this does not contradict the dictum that “overall phase changes 
don’t change states, but relative phase changes do.” 

Our final case might at first seem a bit odd. First write one of the 
coefficients, say c4, as the sum of two other complex numbers: c} = ni +n2. 
Calculating |c;|? in terms of nı and nz, we have 


Je4|? = [ra + nl? = |n|? + [na| + ning + noni 
= |ni|? + na|? + 2Re(nin3). (10.20) 
Now suppose that nı somehow “picks up” a phase of et? (with o con- 


stant). Clearly, this doesn’t alter the modulus, or “length,” of nı (see 
Fig. 10.1). However, the complex square of the coefficient now becomes 


io 


lln + na|? = ni|? + Ina? + ninže + nonje™ 


= |m |? + [n2]? + 2[Re(nın}) cos(a) — Im(nin§) sin(o) 
(10.21) 


Evidently |n +n2|? 4 |en +n2|?, even though nı and efn; have identical 
moduli. And indeed, given Fig. 10.2, that’s what we would expect. 

This process—writing an expansion coefficient as a sum of com- 
plex numbers and then modifying one of those numbers by a phase 
factor—apparently can alter probabilities. But does this correspond to any- 
thing that actually occurs in quantum mechanics? The answer, we shall see, 
is Yes. 


10.2.3 Unitary Operators Revisited 


Unitary operators alter the phases of their eigenstates, and this can alter 
probabilities. But the process is subtle, and likely eludes many students 
(and more than a few instructors!). 

Discussions of unitary operators are easily found, and they may men- 
tion that an overall phase change does not alter a quantum state, while a 
relative phase change does. Still, explanations of just how and when uni- 
tary operators and relative phases can alter probabilities are not standard 
textbook fare. 
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Chapter 5 introduced unitary operators and outlined their basic prop- 
erties. We also saw that unitary operators transform states, and operators, 
but we didn’t discuss their mathematical form, or how, in practice, they 
act on states. 

An arbitrary unitary operator, U, is defined by the property U-1 = 
Ut. This property implies (see Appendix D) that the form of U is: U = 
exp(iG7), where y is a real parameter, and G is a Hermitian operator called 
the generator of the transformation effected by U. 

Take the eigenstates of the generator G to be the |¢x)s: 


Gon) = gel@r)- (10.22) 


This gives G’s action on the |¢;)s, but what’s U’s action? In U, the operator 
G appears in an exponential. To impart meaning to such an object, the 
exponential is defined as its Taylor expansion. Thus, U’s action on the 


lox) is 


a hes A iG 2 iG 3 
be) = enh) =| biG + CE", CCM) ro Jeo 


A 2 ; 3 
{i tony + EADE 4 ST +... Y ha) = 14). (10:23) 


Here I used the fact that |¢,) is an eigenstate of G. Then everything was 
“collapsed” back down into an exponential. The end result is that an eigen- 
state of G is also an eigenstate of U , so when U acts on an eigenstate of Â, 
we can simply replace G (in U) with the corresponding eigenvalue. This is 
a general result: it applies to any unitary operator, its generator, and their 
eigenstates. 


10.2.4 Unitary Operators, Phases, and Probabilities 


This section is a rather abstract discussion of the effects of unitary operators 
on probabilities. If you’re content with a concrete example, you can skip to 
the next section. 

If U acts on a state |W) that is not an eigenstate of G, we may 
examine its action by expanding |W) in the eigenstates of G. Suppose 


|v) = 2 ciloj); then 


j=l 
> m 
ÙY) = |W) = X t cloj). (10.24) 
j=1 
For simplicity, Pve set y = 1. Evidently the effect of any unitary operator 
is, at most, the introduction of relative phase factors with respect to the 
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operator’s eigenstates—in Eq. (10.24), the etis are introduced with respect 
to the |dx)s. 

Even a relative phase change, however, leaves the (complex) square 
of the coefficient of each |¢;) unchanged: |e’%c,|? = |c;|?. And that, in 
turn, implies (as in Section 10.2.2) that a relative phase change leaves the 
probabilities unaltered—at least, that is, for observables whose eigenstates 
are the |dx)s. 

So how can a relative phase change alter probabilities? There’s another 
possibility: introduce a relative phase change with respect to one set of 
eigenstates, and then examine the probabilities for an observable with 
different eigenstates. 

Suppose another orthonormal basis is comprised of the |a,,)s, which are 
not eigenstates of G, but are eigenstates of an operator A, corresponding 
to an observable quantity A. Suppose, further, that A satisfies 


Alan) = anlan). (10.25) 


Then the a,s, the eigenvalues of A, are the possible measurement 
values of A. 
Take the expansion of the |¢,)s in the a basis to be 


Ios) = J ailax). (10.26) 


The al s are expansion coefficients. The index j is fixed: it simply denotes 
the particular |¢) (in this case, |¢;)), that we’re expanding. 

To calculate the probability corresponding to, say, ag in the state |W), 
first expand |W) in the a basis: 


m 


|) = So glg) = i [S> ailan} = X callan). (10.27) 


j=l n=l SL 


Clearly, the index j now is not fixed, reflecting the fact that |W) can include 
contributions from all of the |¢,)s. From Eq. (10.27), the probability 
corresponding to ag is? 


Prob(ag|v) = [S ea} = leah +--+ omar’. (10.28) 
j=l 


Now let U act on |W), denoting the new state |W). This produces a 
relative phase change (see Eq. (10.24)). Then, as in Eq. (10.27), transform 


9 If we start out in Eq. (10.27) not with |W), but with et? |Y), where et? is an overall 
phase, we find that the probability corresponding to ag is unchanged. 
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to the a basis: 


Yv) = U|¥) = dete lġ;j) = = Lema > al |an) | 


m 


=. eI cjal jan). (10.29) 


The probability corresponding to ag is now 
Prob(ag|Vu) = | yS e'i cja} = |e’ ciah Heet ecmag Fe (10.30) 
j=1 


The difference between Eqs. (10.30) and (10.28) is analogous to the differ- 
ence between |en; + ng’, in Eq. (10.21), and |ny + nal’, in Eq. (10.20). 
Evidently, then, the probability in Eq. (10.30) has been altered from that 
in Eq. (10.28). 

Note the process that’s unfolded. A relative phase change in the ¢ basis 
didn’t alter probabilities with respect to that basis. However, upon trans- 
forming to the a basis each |ag)’s coefficient was a sum of complex numbers, 
and the relative phase change in the ¢ basis resulted in an alteration in the 
phases of these numbers. As we know from Section 10.2.2, this does alter 
probabilities. !° 

We may summarize our results as follows: 


Statement 1: The introduction of a relative phase with respect 
to the eigenstates of an operator A can only alter probabilities 
corresponding to the eigenstates of an operator that does not 
commute with A. 


Because a unitary operator can, at most, introduce relative phase factors 
with respect to the operator’s eigenstates, we may recast our statement 
thus: 


Statement 2: The maximal effect of any unitary operator U is the 
introduction of relative phases with respect to U’s eigenstates. 
This, in turn, can only alter probabilities corresponding to the 
eigenstates of an operator that does not commute with U. 


10.2.5 Example: A Spin + System 
Our discussion of unitary operators, relative phases, and probabilities may 
have seemed a bit abstract. We can make things more concrete with a simple 


10 Recall, also, that if two complex numbers are added, the modulus of the sum is 
altered if a relative phase is introduced between them. See Section 10.1.3 and Fig. 10.2. 
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example that illustrates how a unitary operator introduces a relative phase 
change, and how that, in turn, alters probabilities. 

In our example we'll again use a spin i system. Although it’s conven- 
tional to take |+ z) and | — z) as basis states, many other (really, infinitely 
many) suitable bases exist. One such basis comprises the states | + x} and 
| — x). It must, of course, be possible to write | + z) or | — z} in the z basis, 
and vice versa. It turns out that!! 


|+2) = 45(I+2)+1-2)), 
|+2) = (1+2) +|-2)). (10.31) 


Consider a system in the state |+ £) = a (| +z)+|—z)), and a unitary 
operator Ô, = exp(iSz A). The probabilities for measurements of S, in |+2) 
are Prob(+h/2) =1 and Prob(—h/2) = 0. 

Now act on |+ x) with U,, and then transform back to the x basis: 


cider! +2) = (e 1RA/2) +z) + ei) = 2)) 
= SS" (lta) +|- a) + (\42)-|-2)) 
_ elves? | a eu ene | D 


= cos(hA/2)| + x) + isin(hA/2)| — x). (10.32) 


Clearly, the probabilities for Sẹ, have changed. They are now: 
Prob(+h/2) = cos?(hA/2) and Prob(—h/2) = sin? (ñA/2). 

The operator Û, introduced relative phases into the z basis, which, 
upon transforming back to the x basis, became “mixed.” That is, | + x) 
and | — x) each acquired coefficients consisting of a sum (or difference) of 
the relative phases, e’”/? and e~*/2, that were introduced in the z basis. 

This example illustrates the minimum condition for a unitary operator 
to alter probabilities. If A represents the observable of interest, then A 
cannot share eigenstates with U; the system state may be an eigenstate of 
A, but not of U. In the example, Sa, which represents the observable of 
interest, does not share eigenstates with U,. The system state, | +), is an 
eigenstate of as but not of Us. 

As pointed out in Section 5.4, unitary operators are of secondary 
importance for us—with one exception. The time evolution operator plays 
a central role in quantum mechanics. As we’ll see in Chapter 11, Eq. 
(10.32), with minor modification, is an example of quantum-mechanical 
time evolution. 


11 For example, these relations can be obtained from Townsend (1992), Problem 1.3. 
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10.3 Wavefunctions 


I’ve ignored wavefunctions so far in this chapter, because they would intro- 
duce yet more subtlety into our discussion of the phase. Ill deal with 
wavefunctions in more depth later, and the phase will afford considerable 
insight. But since we’ve now gained some familiarity with the phase, let’s 
at least see what it looks like in a wavefunction, and how our understanding 
of the phase translates into the language of wavefunctions. 

If we restrict consideration to the x axis, then a wavefunction in posi- 
tion representation is a function of x and t, which we’ll call w(a,t). The 
wavefunction w(x, t) could be transformed into momentum representation. 
It would then be a function of p and t; call it ọ(p, t). Of course, y(x, t) and 
o(p, t) are the same quantum state—the same wavefunction—written in 
different representations, or bases.!? 

The position eigenstates are the Dirac deltas, d(a — xo), while the 
momentum eigenstates are also Dirac deltas, but now in momentum space: 
ô(p — po). Thus, each position eigenstate corresponds to a single, unique 
position (x value), and each momentum eigenstate corresponds to a single, 
unique momentum (p value).'® 

Now, how might we think of a function of x, say f(x)? If f(x) is a real 
function, then we can think of f(x) as a collection of real numbers—one 
for each x value. Similarly, if f(x) is a complex function, as wavefunctions 
generally are, then f(x) is a collection of complex numbers—one for each 
x value. 

If w(a,t) is a complex function then, like a complex number, it can 
be written in polar form.!4 That is, ~(x,t) may be written in the form 
R(a,t)exp(tS(a,t)), where the modulus, R(x,t), and the phase, S(z,t), 
are real functions of x and t. Once in polar form, we can “read off” the 
phase, S(x,t), just as we did for complex numbers in Section 10.1.2. 

The continuously distributed complex numbers that constitute a wave- 
function are in fact the expansion coefficients. And, just as in the discrete 
eigenstate cases we considered earlier, multiplication of each such coefficient 
by the same phase constitutes an overall phase change. 

Now multiply ~(2,t) by the overall phase e’4, where A is a real 
constant, 


(a, t) = R(x, the) — R(x, thee @, (10.33) 


How does this affect the position probability distribution? The orig- 
inal distribution was |w(a,t)/? = |R(a,t)e%@|? = R(a,t)?. The 


12 Although (a, t) and ¢(p,t) are the same wavefunction—in the sense of the same 
quantum state—they will not, in general, be of the same functional form. 

13 As pointed out in Section 2.3.2, Dirac deltas are not proper functions. 

14 After all, p(x, t) is just a collection of complex numbers. 
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phase transformation of Eq. (10.33) left this distribution unchanged: 
|R(a, the4e#@ |? = |R(a,t)eS@4)|?. But this, of course, is what we 
expect for an overall phase such as exp(7A). 

Similarly, for both the discrete eigenstate case and for wavefunctions, a 
relative phase change is one that multiplies different coefficients by different 
phase factors. In the state w(a,t), each coefficient is associated with a 
particular x value. Thus, for the position-space wavefunction w(2,t), a 
relative phase is a function of x. For example, multiplying y(x, t) by eae), 


plx, t) = R(x, the’) — R(x, the’* eS), (10.34) 


changes u(x,t) by a relative phase, since X(x) is, in general, different for 
each different x value. 

How does this relative phase change affect the position probabil- 
ity distribution? The original distribution was, again, R(æ,t)?. And 
again, the phase transformation leaves the probability unchanged: 
|R(a, the'* et? = R(a,t)?. As we now know, this is precisely what 
a relative phase change should do. 

It should be clear that exactly analogous arguments apply for the 
momentum-space wavefunction, ¢(p, t). In that case, of course, the relative 
phase factor would be a function not of x, but of p, such as eP), 

We’ve seen that a relative phase change can alter probabilities only in 
a basis other than the one in which the phase change was introduced. The 
same holds for wavefunctions. The phase change of Eq. (10.34) would lead 
to an altered momentum probability distribution. Similarly, if a relative 
phase change was imposed on ¢(p,t), the position probability distribution 
would be altered. We will revisit these ideas in Chapter 12. 


10.4 Reflections 


Of what practical value is all this discussion about the complex nature 
of quantum states, of overall and relative phases, etc.? In one sense, it’s 
not of much practical value. Even without understanding such things, if 
we know how to use the mathematics, we can carry out calculations and 
obtain predictions. If, for example, we know how to operate with a unitary 
operator, transform between bases, and extract probabilities, we don’t need 
to understand how relative phases lead to altered probabilities. 

In another sense, however, understanding the complex nature of quan- 
tum states is crucial. Physics, after all, is not a rote, turn-the-crank process. 
It’s a creative process: we create the physical world—concretely, in the lab- 
oratory—and abstractly, in our minds and on paper. To understand the 
world, to create the world, we must marshal all of our faculties. Without 
understanding the complex nature of the quantum state, we cannot fully 
understand how quantum mechanics works—so we cannot bring our full 
creative power to bear. 
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10.5 Problems 


1. Consider two complex numbers: 2 + 2V/3i and 3 — 3i. 
(i) Determine the phase of each number. 
(ii) Determine the relative phase, that is, the phase difference, 
that will: 
(a) maximize the modulus of the sum, 
(b) minimize the modulus of the sum. 
(iii) Sketch the two numbers and their sum on an Argand diagram. 
2. Assume that A and B are complex numbers, which may be written 
in standard form as: A = a + iĝ and B = y + iĝ. By manipulating A 
and B in standard form, show explicitly that the following relations 
hold. 
(i) A + A* = 2i ReA 
(ii) A — A* = 2i ImA 
(iii) AB is a complex number. 
(iv) Re(AB*) = Re(A*B) 
(v) (AB)* = A* B* 
(vi) |ABP = |APIB} 
(vii) AB* + A*B = 2(ReA ReB + ImA ImB) 
3. Now repeat the preceding problem, but working with A and B in 
polar form, that is: A = Rye’’4 and B = Rge’®®. 
4. Write the following as complex numbers in polar form. 


raw Argand diagrams that represent the following. 
(i) the function f(a) = xexp(ix?) at x =2 
(ii) the function g(x) = x {cos(x) + isin(x)} 
(iii) the function h(x) = sin(x) + ix? at x = 1 
(iv) f(x) in the range 1 < x < 2 
(v) h(a) in the range 0 < z< 3 
6. By considering phases, argue/show whether or not the two states in 
each of the pairs below are the same state. 


1 1 
-1 v2 
1 hae gel 
aala) aC) 
i + C22), cet? |1) + c212) 


iv) cill) +e), el (cil) + e212)) 

v) c1|1) — ic2|2), icy|1) + c2|2) 
(vi) cye*?+|1) + cge’?2|2), c11) + cget(2—41) |2 

7. Explicitly obtain the relations for |ny +n2|? and for Je’? n, +n2|? that 
appear in Section 10.2.1. 
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8. Consider a state |), and suppose |Y} could be expanded in the |r;} 
basis states, where j = 1,2,...,m. 
(i) Show that if |w) is subjected to a phase change of e'7, that is, 
|) — e’ |), where y is a constant, then there is an overall phase 
change in the 7 basis. Show explicitly that this phase change cannot 
alter the probabilities associated with the |r)s. 
(ii) Suppose the |w,)s (with k = 1,2,...,m) form another suitable 
basis. Show that |w) — e’7|wW) cannot alter probabilities in this basis, 
either. 
(iii) Use the foregoing to argue/show that if a quantum state is subject 
to an overall phase change in any basis, then it is subject to the same 
overall phase change in every basis. Thus, an overall phase change in 
any basis implies that all probabilities are unchanged. This is why we 
say that an overall phase change does not change a quantum state. 
(iv) As just pointed out, we say that if a quantum state is “changed” 
such that no probabilities are altered, we have the same state. This 
argument may seem unsupportable, but similar arguments—based on 
whether the physics changes—are made in classical physics. 
Investigate gauge transformations in classical electromagnetism (most 
electricity and magnetism texts discuss such transformations). Then 
briefly discuss how such transformations are analogous to an overall 
phase change in quantum mechanics. 

9. Consider two quantum states |) and |Y’), where 


|) = cildi) + c2|¢2), |") = eel) + e"c2|b2). (10.35) 


The |¢;)s satisfy: 


Âli) = a;l). (10.36) 


(i) Show that the probability of obtaining a, is the same for both 
states (for all n). 
(ii) Another set of states, the |)s, satisfies: 


Blxx) = belxe), where: [A,B] £ 0. (10.37) 
Expanding |¢1) and |@2) in the |x) basis yields: 


|o1) = a1|x1) + a2|x2), lġ2) = Gilx1) + Balxe). (10.38) 


Calculate the probability of obtaining bı in a measurement of B for 
both states, |p} and |y’). Then calculate and simplify the difference 
between the two probabilities. 

(iii) Summarize (in words) the results of Parts (i) and (ii). 


11 
Time Evolution 


To-morrow, and to-morrow, and to-morrow, 
Creeps in this petty pace from day to day, 
To the last syllable of recorded time... 
Macbeth! 


Time—within which we dream our dreams, live through joy and sorrow, 
and confront the incomprehensibility of our existence—is that by which we 
know the poignant beauty of life, even as it slips from our grasp. 

The time of physics is not invested with the poignancy of “personal 
time,” the time within which we live our lives. Yet even in physics, time 
plays a special role. Time is the background against which physical pro- 
cesses take place, and the irreversibility of such processes—embodied in the 
second law of thermodynamics—is fundamentally rooted in the behavior 
of time, not space. 

In much of quantum mechanics, time is irrelevant: the goal is to calculate 
energy levels by solving the time-independent Schrodinger equation. Nev- 
ertheless, time-dependent quantum systems are fundamentally important: 
the quantum world is not stagnant! This chapter focuses on time-dependent 
quantum systems, but superpositions, change of basis, Hermitian and 
unitary operators, complex phase factors, commutators, and uncertainty 
relations all enter into the discussion. Thus, while time evolution is of great 
importance in its own right, it will also serve as a means to assemble much 
of what we’ve learned so far into a coherent whole. 


11.1 The Time-Dependent Schrodinger Equation 


Chapter 9 focused on the time-independent Schrödinger equation. But, as 
pointed out there, the real starting point is the time-dependent Schrödinger 
equation: 


qv). 
ihz = HI). (11.1) 


1 W. Shakespeare, Macbeth, Act 5, scene 5. 
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Equation (11.1) is a general law of quantum mechanics. In particular, no 
restrictions are placed on the quantum state, |). It could be a wavefunc- 
tion, or a spin i state, or any other type of quantum state. Furthermore, 
no representation is specified for |W): it’s an abstract quantum state, and 
Eq. (11.1) applies to whatever representation we choose to work in. 

Equation (11.1) is, in general, a partial differential equation (since, 
in practice, H typically involves partial derivatives). Partial differential 
equations are a subject unto themselves and, depending on the form of 
H and the initial state, solving Eq. (11.1) can range from relatively easy 
to impossibly difficult. Our goal, however, is to attain an understanding 
of quantum mechanics, not proficiency in applied mathematics. There- 
fore, we will focus on the most common case, that of time-independent 
Hamiltonians, that is, H 4 H(t). 

As discussed in Chapter 9, if H Æ H(t), then Eq. (11.1) is separable 
into a spatially dependent part and a time-dependent part. Assume we are 
solving for the jth energy eigenstate (the jth eigenstate of H), which we'll 
denote |E;). Again from Chapter 9, the time dependence of |E;) is 


|B; (t)) = e~**/*|2;(0)), (11.2) 
that is, the state simply picks up the overall phase iA 
But an overall phase doesn’t change a state at all.? As such, it may 
seem that time evolution is dead on arrival—at least for systems with time- 
independent Hamiltonians. But such is decidedly not the case, as we’ll now 
see. In the process, we’ll get an object lesson in the complex nature of the 
quantum state: the subject matter of Chapter 10. 


11.2 How Time Evolution Works 
11.2.1 Time Evolving a Quantum State 


From Eq. (11.2) we see that an eigenstate of H, an energy eigenstate, 
doesn’t change in time—it’s “stationary.” For this reason, eigenstates of H 
are also called stationary states. Equation (11.2) gives us the (physically 
meaningless) time evolution of a stationary state. But how do we time 
evolve other (i.e. non-stationary) states? 

Because H is a Hermitian operator—it corresponds to an observable, 
energy—its (normalized) eigenstates form a complete, orthonormal basis. 
Thus, we can expand any state of a system in terms of its energy eigen- 
states. Moreover, Eq. (11.2) applies to any eigenstate of H. Thus, Eq. 
(11.2) provides a prescription for time evolving not only individual sta- 
tionary states, but also superpositions of such states—and thus for time 
evolving any state of the system. 


2 Thus, the notation |Ej(t)) or |E£;(0)) is superfluous—just as well to simply 
write |E;). 
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Suppose, for example, that our initial state, denoted |,(0)), is a super- 
position of energy eigenstates: |x(0)) = >>, cx|Ex). Then the time-evolved 
state, |x(t)), is simply: 


Ix(t)) = XO eirt ep] ER). (11.3) 
k 


Here we started with |x(0)) already expanded in the eigenstates of H. We 
can obtain a general equation for |y(t)), applicable when |x(0)) is not so 
expanded, by working backwards from Eq. (11.3), 


Ix) = Soe PHP eg| Ey) = eS cy. Ex) = ex (0). (11.4) 
k k 


This distills down to 


Ix(t)) = 7+2 (0) = Calx(0)), (11.5) 
where I’ve defined the unitary time evolution operator: Uy = trl, 
Equation (11.5) is the general rule for time evolving an arbitrary initial 
quantum state |y(0)), if H + H(t). The simpler Eq. (11.3) still holds once 
the initial state has been expanded in the eigenstates of H. 

So how, in actual practice, is an arbitrary initial state such as |x(0)) 
time evolved? Basically, Eq. (11.4) is carried out in reverse, that is, 


[x(t)) = eW##/4|y (0)) = e HAHA YS oy | Ey) = Se e | Ey). (11.6) 
k k 


Despite this mathematical prescription, students sometimes have difficulty 
forming a coherent picture of quantum-mechanical time evolution. The 
following “algorithm” for time evolving a quantum state and extracting 
measurement probabilities for some observable is designed to provide such 
a picture. 


1. f Ê = Ê (t), solve the time-dependent Schrödinger equation directly. 

2. If H + H(t), then, 

(a) Solve the system’s time-independent Schrödinger equation, thus 
obtaining the eigenstates and eigenvalues of Å. 

(b) Expand the initial state in the eigenstates of H (from part (a)). 

(c) Operate on the expanded state with Û,; substitute the eigenvalues 
of H (from part (a)) as necessary. 

3. Solve the eigenvalue equation for the operator corresponding to the 
observable of interest. The eigenvalues are the possible measured 
values. 

4. Expand the time-evolved state in the eigenstates obtained in step 3. 

5. Complex square the expansion coefficients. This yields the probabili- 
ties corresponding to the observable’s possible measured values. 
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Mathematically, steps 4 and 5 amount to calculating |(¢;|W(t))|?, where 
|W(t)) is the state of steps 1 or 2, and |¢x) is the kth eigenstate obtained 
in step 3. 

This algorithm provides a concise, unified picture of the time evolution 
of quantum states, and subsequent determination of probabilities. Actually 
implementing it in any particular case may range from easy to dauntingly 
difficult. 


11.2.2 Unitarity and Phases Revisited 


In the example of Section 10.2.5, the unitary operator U, = exp(iS.A) 
acted on a spin 1/2 system in the state | + x). By slightly modifying this 
example we can illustrate quantum-mechanical time evolution concretely. 

If a spin 4 system is in a uniform magnetic field in the +z direction, 
then H = woz. For our purposes, it’s sufficient to set wọ = 1. Because 
H + H(t), the system can be time evolved using Û, = e~##t/h = e~#8=t/h, 

For the case at hand, we already know the solutions to the time- 
independent Schrödinger equation, and how to expand | + x) in the 
eigenstates of H (see Section 10.2.5). Denoting the time-evolved state 
|(t)), then, we have 


Ww) =e 4/9 4) = he) + e*/7|— 2). (11.7) 


As in Section 10.2.5, the probabilities for Sy are determined by trans- 
forming back to the x basis and collecting the coefficients of | + x) and 


|- x): 


WO =S Ehta) +1-2)) +S (/4+2)-|-2)) 


= (= + E) | +2) + (= — j | - 2) 
= cos(t/2)| + x) — isin(t/2)| — 2). (11.8) 


In the initial state, |+2), the probabilities for Sẹ were Prob(+h/2) = 1 and 
Prob(—h/2) = 0. After time evolution, they are: Prob(+h/2) = cos?(t/2) 
and Prob(—h/2) = sin?(t/2). 

In the second line of Eq. (11.8), note that both complex numbers com- 
prising the coefficient of |+ x) are of modulus Z, But because the numbers’ 
phases vary with t, the modulus of the coefficient itself—the numbers’ 
sum—ranges from 0 to 1. (The coefficient of |—2) exhibits similar behavior.) 

All of this constitutes a concrete example not only of time evolution, 
but also of the concepts developed in Chapter 10. The unitary operator Û; 
introduced relative phases into the initial state (in the z basis). This, of 
course, did not alter the probabilities of S4. But upon transforming back to 


3 See, for example, Townsend (1992), Section 4.3. 
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the x basis, |+ 2) and | — x) each acquired coefficients consisting of a sum 
of complex numbers, with relative phases that U; introduced in the z basis. 


11.3 Expectation Values 
11.3.1 Time Derivatives 


In Chapter 1, we posited three hypothetical worlds. In Worlds 1 and 2—the 
worlds of classical physics—the basic objects of interest were particles, and 
the position of a particle was perfectly well defined at all times. World 3 
was the world of quantum mechanics. There, a probability function was 
well defined, so the average position of a particle was well defined—but the 
position of a particle was an improper concept (in fact, the particle concept 
itself was ill defined). 

This indistinct nature isn’t limited to position—it’s a general feature 
of quantum-mechanical observables. That being the case, the value of a 
quantum observable is, in general, a meaningless concept. But the average 
value of an observable is a legitmate concept.* Therefore, although it’s 
meaningless to talk of the time dependence of a quantum observable, such 
as position, it’s quite legitimate to talk of the time dependence of its average 
value, that is, its expectation value. 

Recall that the average, or expectation, value of some quantity A, in 
some state |W), is: (A) = (W|A|W). To calculate the time derivative of (A) in 
the state |W), we treat each “piece” of (W|A|W) as a distinct mathematical 
object, and apply the product rule for derivatives:° 


dA) od. 
HA = 4 aija 
= (SE) aim) + co (3) jy) + (wi4l (22) 


= + ((upér lv) — (wl Amriwy) + (0| @ v) 
= + (w|i, All) +(X] (3) |W). (11.9) 


4 The indistinct nature of observables stems from the inherently probabilistic nature 
of quantum mechanics. For example, for a (non-Dirac delta) wavefunction, a number 
obviously can’t specify the position; but a number does specify the average value of 
position. 

5 For A # A(t), the usual case, |W) must be the time-evolved state. Otherwise, (A) 
obviously can’t change! 
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Note that the Schrodinger equation was used to obtain the third line. We 
are not so interested in calculational details, however, as in what Eq. (11.9) 
can tell us about quantum mechanics. 

Usually, A + A(t), so that 0A/At = 0. Then Eq. (11.9) becomes 


d(A ~ 4 
Me = (U|[H, Al|v). (11.10) 
Consider the case where |W) is an eigenstate of É (a stationary state), 
which we'll call |E;). Then H|E;) = E,|E;). Because H is a Hermitian 
operator, the energy eigenvalue, Æj, must be real. Thus, we have 


mar 


ae (E;|HA|E;) — (E;|AH|E;) 


E; ((E;|Â|E;) — (Ej|A|E;)) = 0. (11.11) 


This, of course, is in keeping with what we expect. Apart from a phys- 
ically meaningless phase, a stationary state does not change in time, so 
neither can any probabilities—and neither, therefore, can any expectation 
values. 


11.3.2 Constants of the Motion 


If [H, A] = 0 (identically), then the observable A is called a constant of the 
motion. This term was presumably carried over from classical mechanics, 
where we usually focus on actual physical motion. Very often in quantum 
mechanics, however, we are not concerned with physical motion, so the 
term constant of the motion is slightly unfortunate; a more accurate term 
would be constant of the time evolution. 

A constant of the motion is definitely not the same thing as a stationary 
state. It’s worth carefully developing the difference beween the two—first, 
because both are intrinsically important, and second, because doing so 
further elucidates important concepts. 

It’s easy to see the significance of a constant of the motion. From Eq. 
(11.9), if A 4 A(t), then [A A] = 0 implies that d(A) /dt = 0. That is, the 
expectation value of the particular observable A is constant in time for all 
states. 

This is distinct from a stationary state—an eigenstate of H. A station- 
ary state is unaltered by time evolution, so the probability distributions 
for all observables must be unaltered for this particular state. 

It’s interesting to make connection with the discussion of Chapter 10. 
If H + H(t), then U, = e~t”. Thus, if [H, A] = 0, then [(%, A] = 0 
also. From Section 10.2.4, this implies that the probability distribution for 
A cannot be altered under the action of Un, regardless of the state. This 
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conclusion—that the probability distribution for A is constant—is consis- 
tent with, but somewhat stronger than, that obtained in Eq. (11.9), which 
applies only to the expectation value of A. 

Moreover, the Schrödinger equation—which was used in obtaining 
Eq. (11.9)—applies only to time evolution, so Eq. (11.9) cannot be extended 
to other unitary operators. But because our result from Section 10.2.4 
depends only on the behavior of the phase, it holds for any unitary opera- 
tor. That is, if [7, A] = 0, then the action of U cannot alter the probability 
distribution for the observable A, and this holds not just for time evolution, 
but for any unitary operator. 


11.4 Energy-Time Uncertainty Relations 
11.4.1 Conceptual Basis 


In Chapter 7 we discussed the general form of the uncertainty relations 


(11.12) 


Here A and B are observables, and A and B are the corresponding 
Hermitian operators. If we substitute x and py for A and B, we obtain 


AzAp, > h/2, (11.13) 


the famous uncertainty relation for position and momentum. 
The so-called energy-time uncertainty relation, 


AtAE > h/2, (11.14) 


is a useful and important result, and it appears to be quite similar to Eq. 
(11.12). But there’s something curious here: although time appears fre- 
quently in quantum mechanics, such as in the time-dependent Schrédinger 
equation (Eq. (11.1)), a Hermitian operator corresponding to time does not 
appear. You won’t see such an operator—because there is none! Time is 
not a quantum-mechanical observable, but a parameter. Evidently, then, 
Eq. (11.12) cannot form the basis for Eq. (11.14). Moreover, the physical 
interpretation of Eq. (11.12) is quite different from that of Eq. (11.14). 
So, where did Eq. (11.14) come from, and what does it mean? To address 
these questions, first recall how the “standard” uncertainty relations of Eq. 
(11.12) are interpreted physically. Starting with a system in some particular 
quantum state, call it |Y}, a measurement of either A or B is performed. By 
repeating this process many times—each time starting with the state |W), 
and each time measuring either A or B—two sets of measurement results 
are acquired, one set for A, and another for B. The spread in values for 
the observables A and B—that is, AA and AB—will be reflected in these 
measurement results. In other words, AA and AB characterize, in some 
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sense, the degree to which |W) is a superposition of the eigenstates of A, 
and of B , respectively. 

In Eq. (11.14), the physical meaning of AF is much the same as the 
meaning of AA and AB in Eq. (11.12). That is, repeated measurements 
of the energy, E, made on a quantum system—each time in the same 
state—will reflect AF. And AE (like AA and AB above) characterizes 
the degree to which the state is a superposition of energy eigenstates. 

The key to interpreting Eq. (11.14) is At. Typically, At represents a 
characteristic time for some physical process. Often, it represents some 
time over which the quantum state changes “appreciably” —an imprecise, 
though qualitatively accurate, description. 

For the spin 1/2 system of Section 11.2.2, At may refer to a time interval 
over which the probabilities of the system could significantly change. For a 
nucleus or particle that decays, At may describe the lifetime of the state.® 
If we’re talking about wavefunctions, then At may characterize how long 
it takes a wavepacket to pass some point. 

So At has no unique meaning, and its interpretation is often not 
straightforward. Largely because of this, the energy-time uncertainty rela- 
tion has been a subject of discussion and debate for decades.” Still, we have 
a rough idea of what At is, and we also know what it is not: At is not the 
spread in a set of “time measurements.” 

Given the concepts of quantum-mechanical time evolution developed 
in this chapter, we can at least gain a qualitative understanding of Eq. 
(11.14). Consider first the limiting case of a stationary state—which, as 
we've seen, never changes. But a stationary state is just an energy eigen- 
state, and an energy eigenstate clearly exhibits no spread in values for 
energy measurements, that is, AZ = 0. In this case, then, AFE = 0 and 
At = 0.8 

If, by contrast, the system state is a superposition of energy eigenstates, 
then AF ¥ 0. Roughly speaking, if the system is “more superposed,” in 
the sense of being more “spread out” over the energy eigenstates, then AE 
will be greater. 

So, how does AF “influence” At? Just as there’s no unique mean- 
ing for At, there’s no unique answer to this question. Regardless, the 
qualitative conclusion is the same: the greater the spread in the energy 
probability distribution—that is, the “more pronounced” the superpo- 
sition of energy eigenstates—the shorter the time scale over which the 
system changes appreciably. The situation was well stated by Leslie 


6 Quantum mechanics is probabilistic by nature. The lifetime is a probabilistic mea- 
sure, defined to be that time over which the probability of not having decayed is 
=a 
em, 
T See, for example, Ballentine (1998), Section 12.3; Busch (2002). 
8 AE = 0 and At — œ tell us what we want to know about this limiting case; we are 
unconcerned with strictly (i.e. quantitatively) satisfying Eq. (11.14). 
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Ballentine: “There is no energy-time relation that is closely analogous to 
the well-known position-momentum [uncertainty] relation. However, there 
are several useful inequalities relating some measure of the width of the 
energy distribution to some aspect of the time dependence.”? The following 
example illustrates these ideas quantitatively. 


11.4.2 Spin i: An Example 


To make things more quantitative, we can extend the spin 1 /2 example 
of Section 11.2.2. There, H = S,, the initial state was | + x), and the 
time-evolved state was, 


WO) = S/F 4 2) = (e + 2) + e? z). (11.15) 


From Eq. (11.15), Prob (E = h/2) = Prob(E = —h/2) = 1/2. Equation 
(3.10) then yields 


Any = VF) =F =f} (B+ %)- (1-31) <4. a9 


In Section 11.2.2, transforming |4(t)) to the x basis led to 


Prob(Sz = h/2) = cos?(t/2), and Prob(.S, = —h/2) = sin? (t /2). 
(11.17) 


The state of Section 11.2.2 was a maximal superposition of the two 
energy eigenstates | + z} and | — z), in the sense that it contained equal 
amounts of each. Let’s consider an initial state which is dominated by only 
one of the energy eigenstates, for example, 


16(0)) = Fal +2) + 5l- 2): (11.18) 

The time-evolved state is 
|d(t)) =e /? Ae] + 2) + eB] — 2). (11.19) 
For this state, Prob(E = h/2) = 1/10 and Prob(E = —h/2) = 9/10. 


Equation (3.10) then yields AE, = 34/10. Transforming |¢(t)) to the x 
basis leads to: 


Prob( Sa = £h/2) = $ + 4 cos(t). (11.20) 


Over what time intervals do |y(#)) and |¢(t)) change appreciably? The 
answer depends on how we define “appreciable,” and that’s open to inter- 
pretation. To be definite, let’s take our definition of an appreciable change 


9 Ballentine (1998), p. 347. 
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to be a change in the probability of some observable of at least, say, .3 (30 
percent) from its maximum value. 

From Eqs. (11.17) and (11.20), we see that the maximum value of 
Prob(S; = h/2) is 1 for |¢(t)), and 0.8 for |¢(#)). These maximums occur at 
t = 0 (and also at other times). How much time elapses before these prob- 
abilities drop by 0.3 from their maximums? For |w(t)), Prob(S; = h/2) 
drops to 0.7 at t © 1.16. For |¢(t)), Prob(S, = h/2) drops to 0.5 at 
t = 1/2 1.57.10 

The uncertainties in energy and time for |w(t)) and |#(t)), then, are 


AEy = h/2, Aty 1.16 
AEs =3h/10, Atg © 1.57 (11.21) 


Thus, AF, > AE, and Aty < Atọ. Qualitatively, this is as expected: 
greater AF implies smaller At. Quantitatively, we find that 


AEyAty y 58h, AEyAts = ATH. (11.22) 


The first relation satisfies Eq. (11.14), while the second violates it only 
slightly. That’s not bad, considering our somewhat arbitrary definition 
of At. 

This example provides a case study of the energy-time uncertainty rela- 
tions, illustrating the interplay between the time scale over which a system 
changes and how “spread out” it is over its energy eigenstates. Moreover, 
it puts into practice much of what was discussed in this chapter. 

Finally, one topic is conspicuous by its absence from this chapter: the 
time evolution of wavefunctions. That lacuna will be remedied in the 
final chapter. There at last, with the fundamental structure of quantum 
mechanics now at the ready, I will endeavor to build a coherent picture of 
wavefunctions—including their time evolution. 


11.5 Problems 


1. Use Eq. (11.9) to show that the expectation value of an observable A 
does not change with time if the system is in an energy eigenstate, 
and A does not depend explicitly on time. Assume that A and H do 
not share eigenstates. (You may also assume that H 4 H(t).) 

2. Suppose the initial state of a spin 1 system is 


|Y(0)) = 4 (11,1) + 11,0) + [1,-1)). 
Take H = B82, where (7 is a constant. Find the system’s state at some 
arbitrary later time t. (Carry the calculation as far as possible with 


the given information.) 


10 For both states, similar arguments apply for the case of Sz = —h/2. 
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3. Take the initial state of a spin $ system to be: |W(0) ) = 4f +2) + 


|- z2) y The system is subject to a Hamiltonian operator that satisfies 


| +z) = E| 2), H + H(t). 


(i) Your goal is to determine Prob(S, = h/2) for the state |W(¢)). 
First outline, step by step, in words, how you plan to go about this. 
Then do it, including carrying out all indicated multiplications. 
(ii) Imagine you were really doing this—not just as an exercise, but 
for a research calculation. Then, if you’re careful (and if you aren’t, 
you have no business being in physics), you would check that your 
answer to Part (i) is reasonable. As one such check, state the range 
of values within which any acceptable probability must lie, and then 
show that your answer lies within that range for all t. 

4. A friend, taking quantum mechanics at another university, tells you 
the following. 


Our textbook showed that if you start with a free-particle 
Gaussian and let it time evolve, then a reasonable estimate 
of the time required for significant spreading is: T = ma? /h. 
As our professor said, if the mass, m, is 1 g, and the width, a, 
is .1 cm, then the spread time T is roughly 107°s. Obviously 
we don’t see macroscopic particles spread in the real world, 
and this explains why. 


You could dispute your friend’s statement based on (1) the fact that 
macroscopic objects are comprised of microscopic objects, and (2) the 
choice of mass is too large. Please provide two other clearly stated 
reasons why the student’s argument does not adequately explain the 
non-spreading of macroscopic particles. 

5. This problem helps illustrate the connection between quantum time 

evolution, altering the phases of quantum states, and changing 
probabilities. 
Suppose there exist two independent, orthonormal bases in which to 
represent some particular quantum state: the alpha basis, comprises 
a1) and jaz), and the beta basis, comprises |81) and |82}. Assume, 
also, that 


Ala; =ajlaj), BI Bj.) = brl8r). 


(i) Suppose the initial system state is |a,), and the time evolved the 
state simply changes by an overall phase factor, exp(—20t/h) (with 0 
a constant). 

(a) Show explicitly whether or not the probabilities for a, and ag are 
changed. 

(b) Show explicitly whether or not the probabilities for bı and bz are 
changed. 
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(ii) Now suppose the initial state is c,|a1) + cgla2), and the time 
evolved state is cı exp(—idit/h)|a1) + c2exp(—idet/h)|a2), with %1 
and @2 constants. 

(a) Show explicitly whether or not the probabilities for a and az are 
changed. 

(b) Show explicitly whether or not the probabilities for bı and bz are 
changed. 

6. The Hamiltonian for a spin 4 particle in a uniform magnetic field in 
the z direction is: H = woz. At time t = 0, the particle is in the state 
\(0)) = | + y). Note that |ty) = S5\+2) x 5l-2)- 

(i) Find |:(t)), the state at time t. Be sure to pull out an overall 
phase, to simplify your work in Part (ii). 

(ii) When is the system first in the state | — y) (after t = 0)? 

(iii) What is the probability that a measurement of Sy will result in 
+h/2 at time t? 

7. The state of a quantum-mechanical system at time t = 0 is |W). 
Suppose that Gly) = Yk|Yk). Suppose, further, that |W) = X; ali) 
(i) What is the probability distribution for the possible measurement 
values associated with G in the state |)? 
(ii) Suppose the (time-independent) Hamiltonian of the system is H = 
aF? (where a is a constant), and that F satisfies: F|8;) = b;|@;). If 
|W) =>, dn|Gn), find the state of the system at time t = T. 
(iii) Explain how you would obtain the probability distribution for 
the possible measurement values associated with G at time t = T? 

8. Sometimes the energy-time uncertainty relation appears in the form: 


AA h 
AE (cam) a (11.23) 


Suppose that neither H nor A are explicitly time dependent, and 
consider the following cases. 

(i) The system is in an eigenstate of H, and [H, 4) =0. 
(ii) The system is in an eigenstate of H, and [H, A] #0. 

(iii) The system is not in an eigenstate of H, and [H, Al =0. _ 

(iv) The system is not in an eigenstate of either H or A, and 
[H, A] £0. 

(v) The system is not in an eigenstate of H, but (at the time of each 
measurement) is in an eigenstate of A, and [H, A] 4 0. 

Qualitatively (and briefly) describe AF, AA, and |d(A)/d¢| for all 
five cases. Then briefly argue whether each case can be qualitatively 
reconciled with Eq. (11.23). That is, for each case, can the left-hand 


side of Eq. (11.23) be greater than zero? 
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We thus find that in order to describe the properties of Matter, as well as those of 
Light, we must employ waves and corpuscles simultaneously. We can no longer 
imagine the electron as being just a minute corpuscle of electricity: we must asso- 
ciate a wave with it. And this wave is not just a fiction: its length can be measured 
and its interferences calculated in advance. 


Louis de Broglie! 


1905 was Albert Einstein’s annus mirabilis: his miracle year. In that 
one year, young Einstein published five landmark papers. Less appre- 
ciated, perhaps, is Erwin Schrödinger’s astounding scientific feat in the 
year 1926. In that one year, Schrödinger published six important papers. 
Four of these—submitted to Annalen der Physik within a span of only 
six months—together constitute Schrodinger’s landmark formulation of 
quantum mechanics in terms of a wave equation: wave mechanics.” 

As a practical tool, wave mechanics largely supplanted Heisenberg’s 
matrix mechanics. It was wave mechanics that utilized the familiar 
mathematics of differential equations. And it was wave mechanics that 
presented at least some kind of picture to aid in one’s thinking about 
quantum physics. Wave mechanics became the tool of choice both for 
solving many quantum-mechanical problems and for expositions of the 
subject. 

The approach in this book has been different—although wavefunctions 
have been discussed,’ they’ve played a decidedly secondary role. The pri- 
mary goal has been to build a firm foundation in the fundamentals of 
quantum mechanics. Now that that foundation is in place, we can use 
it in a systematic, if brief, exposition of wave mechanics. In so doing, 
we will uncover the correspondence between the quantum mechanics of 
wavefunctions and that of discrete eigenstates. 


1 Louis de Broglie, Nobel Prize Address, 1929; published in Boorse (1966), p. 1059. 

2 Use of the term “wave mechanics” has gradually declined. The term “quantum 
mechanics” includes, of course, both wave and matrix mechanics. 

3 See, in particular, Chapters 2, 3, 9, and 10. 
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12.1 What is a Wavefunction? 
12.1.1 Eigenstates and Coefficients 


At first glance, it may appear that the quantum mechanics of wavefunctions 
bears little resemblance to, say, that of a spin j system. 

A spin Z state is (at most) a superposition of just two basis states, such 
as | + z) and | — z); for example: 


la) = c| + 2} + c-| — 2). (12.1) 


A wavefunction, however, is just what its name implies: a continuous func- 
tion. One possible wavefunction, for example, is a Gaussian centered at 
ao: U(x) = Aexp |[—B(x — x£o)?] (with A and B constants). These two 
quantum states, |a) and U(x), look very different. What connections can 
we uncover between them? 

An obvious discrepancy is that bra-ket notation is used in the spin 4 
case, but not for the wavefunction U(x). Bra-ket notation can be used for 
handling wavefunctions, but it’s a tricky business—one best deferred until 
we better understand wavefunctions themselves. 

For a spin l system, a basis consists of two states—one correspond- 
ing to each possible measured value of the observable asociated with a 
Hermitian operator. For example, | + z) and | — z} correspond to h/2 
and —h/2, respectively, the two possible measured values of S, (the 
observable associated with S.). Obviously, the basis states are discrete 
(countable). 

Similarly, for a wavefunction there must be a position basis state corre- 
sponding to each possible measured value of position, each x (the observable 
associated with #). But because the positions, the xs, are continuous, 
an infinite set of position basis states is required to form the position 
basis. 

What about expansion coefficients? For the spin 4 state |a}, the coef- 
ficients are the complex numbers c+ and c_. Their complex squares, |c+|? 
and |c_|?, are the probabilities of finding the system to be in |+2) or |— 2), 
respectively, upon measurement. If |œ) is properly normalized, the sum of 
these probabilities must equal the total probability, 1. 

But where are the expansion coefficients in a wavefunction? The 
answer is: the set of all expansion coefficients is the wavefunction. Adding 
the squares of the expansion coefficients for a system with discrete eigen- 
states, such as the spin 4 state |a), is analogous to integrating over the 
squares of the expansion coefficients in a state with continuous eigen- 
values, such as the wavefunction W(«).° If U(x) is properly normalized, 


4 Thus, when dealing with wavefunctions, the word function is often substituted for 
state. 

5 Consider calculating the total mass in some region of space. If the mass in the 
region is comprised of discrete objects, we simply sum their individual masses. But if 


What is a Wavefunction? 159 


integrating |W(a)|? over the entire wavefunction must equal the total 
probability, 1. 

A spin 4 basis state such as | + z) can, of course, be written in any spin 
3 basis. Written in the z basis (its own eigenbasis), the state | + z) has 
only one non-zero coefficient. Written in the x basis, however, | + z} is the 
superposition Zl +2) 4 zl x); both coefficients are now non-zero. 

The position basis states, too, can be written in different representations 
(we'll see how in Section 12.2.2). If, however, we write a position basis state 
in the position basis (its own eigenbasis), it’s a Dirac delta. For example, 
ô(x— xo) is the position basis state corresponding to xo, written in position 
representation. This makes sense: if the spin $ state | + z) is written in its 
own eigenbasis, there is only one non-zero coefficient. If a position basis 
state is written in its own eigenbasis, it’s a Dirac delta, and is non-zero for 
only one value of x, that is, only one coefficient is non-zero. 


12.1.2 Representations and Operators 


Perhaps the term “wavefunction” suggests to you a function of position. 
Clearly, the wavefunction U(x) is a function of position, but that’s because 
we’ve chosen a specific representation: the position representation. And a 
wavefunction—like a spin $ state—can be written in more than one repre- 
sentation. For wavefunctions, there are typically only two representations 
of interest: the position representation and the momentum representation. 

Before considering the momentum representation of a wavefunction, 
let’s briefly return to the spin $ case. The states | + z) are eigenstates of 


the operator ô.. T hey satisfy the eigenvalue equation: 


§.|£2) =+2|+2), (12.2) 


again illustrating that if an operator acts on one of its eigenstates, it may 
be replaced with the corresponding eigenvalue. If an operator acts on a 
non-eigenstate, this no longer holds, but we may expand the state in the 
operator’s eigenstates and then act on it. For example, for the state |a) of 
Eq. (12.1): 


s A h 
S.|a) = 8.(c4| +2) +e - 2)) = 7 


These are familiar ideas. How do they apply to wavefunctions? 


the mass is continuously distributed, we must integrate the mass density (of dimensions 
mass/length®). Similarly, to obtain the total probability, |c+|? and |c_|? are simply 
added, but |(x)|? is a probability density (of dimensions probability /length), which 
must be integrated. 
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For a wavefunction written in position representation, such as (x), the 
basis is formed by the position eigenstates. These satisfy 


êlre) = ax), (12.4) 


where ĉ is the position operator. Equation (12.4) is analogous to Eq. (12.2). 
In both equations, an operator acting on its eigenstates is replaced with 
the corresponding eigenvalue. But, just as Eq. (12.2) does not mean that 
S., an operator, equals +8, Eq. (12.4) does not mean that ĉ, an operator, 
equals x. 

For a wavefunction in position representation, the (linear) momentum 
operator is: py = ing. As we saw in Chapter 7, ĉ and p, do not commute, 
so they do not share eigenstates.® 

What, then, are the eigenstates of pp? We can answer this by solving a 
simple differential equation 


Px f(x) = Te =p] (x), (12.5) 


where p, is the momentum eigenvalue (a number). It’s easy to see that 
Eq. (12.5) is satisfied by f(x) = N exp(ikx), where N is a normalization 
constant. Thus, N exp(i«) is the eigenstate of py, the linear momentum 
operator, with eigenvalue py = ik. 

Good enough; but to really understand f(a) demands careful considera- 
tion! Let’s first describe f(x) in words: f(x) is the eigenstate of momentum 
corresponding to the eigenvalue py, written in the position representation. 
The position probability distribution for f(a) is: |f (x)|? = |N exp(ikx)|? = 
|N|?. That is, all positions are equally probable for the momentum eigen- 
state f(x). For a position eigenstate, however, only one, unique x value has 
a non-zero probability. In this sense, position and momentum eigenstates 
are as unlike each other as possible.” 

It might at first seem strange that a state could describe a position 
probability distribution spread over the entire x axis. But consider a more 
familiar case. 

The state |+z) is an eigenstate of ĝ,, which is reflected in the fact that, 
in the z basis, |+-z) (obviously) has only a single non-zero term: |+2) itself. 
And, as an eigenstate of $., |+z) corresponds to a unique S, value, h/2. 

But in, say, the x basis, |+z) is a superposition, that is, it’s “spread 
out” over the basis states |+a) and |—«). And, of course, it must be spread 
out: if |+z) could be represented by just one x basis state, then there 
would be only one possible measured value for Sẹ on |+z). But then |+z) 


6 Note that p,WV(«x) = —ihdW/dx creates a new function, associating a number 
with each x. Only if each such number depended solely on the corresponding posi- 
tion eigenstate—not, as is the case, on U(x)—could the position eigenstates also be 
eigenstates of jp, (compare ĉ). 

T Because f(x) extends over the infinite x axis, it’s non-normalizable. As discussed in 
many quantum texts, however, N is often assigned a value for calculational purposes. 
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would be an eigenstate of Sz, which it is not. Similarly, a momentum eigen- 
state must be spread out in the position representation, otherwise it would 
be a position eigenstate. 


12.2 Changing Representations 
12.2.1 Change of Basis Revisited 


As discussed in Section 10.3, a wavefunction may be expressed in either 
position or momentum representation, for example, as (x,t) or (p, t). 
But how does one actually transform between the two representations? 

To answer this question, let’s revisit non-wavefunction states. Consider 
a state |T) that may be expanded in either the |a,;) or |by,) basis states 
(where j,k = 1,2,...,n): 


IT) = $ /(ajlP)|aj) = (a |E})la1) +--+ + (@nIP)lan), (12.6) 
IF) = So (be IP) Ibe) = (br IP) |b) + -++ + (bn IT) lbn). (12.7) 
k=1 


Now suppose we wish to transform |I} from the a basis to the b basis. 
Conceptually, we know that each of the |b;,)s will, in general, be composed 
of contributions from all of the |a;)s.* Thus, the |b1) “piece” of |T} will in 
general consist of contributions from all n of the |a;)s. 

Formally, we could carry out the transformation by acting on |I}, writ- 
ten in the a basis, with the identity operator (the sum of all projection 
operators), written in the b basis 


FID) = So (br) (bel) = $ lbr) (bel X (asl) a5) 
k=1 k=1 j=l 


n 


= X_ (aj|P) (belay) |be). (12.8) 


j,k=1 


By considering only the k = 1 term in Eq. (12.8), we again see that the |b;) 
piece of |T} consists, in general, of contributions from all n of the |a;)s. 


12.2.2 From z to p and Back Again 


How is this relevant to wavefunctions? The last expression in Eq. (12.8) is, 
in essence, a “recipe” for transforming a state |T} from the a basis to the 
b basis. Using this recipe as a guide, we can construct a transformation to 
take a position-space wavefunction, (x), into momentum space. 


8 Similarly, suppose ĉ, ĝ, 2, and #’, ĝ', 2’ define two Cartesian coordinate systems. 
Then the vector #’, for example, consists in general of contributions from all three 
unprimed unit vectors. 
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To see how, let’s set k = 1 (as suggested above) and j = 2 in the last 
term of Eq. (12.8); this yields: (a2|T) (bi|a2)|b1). The expansion coefficient 
(a2|[) is how much |a2)—one of the “old” basis states—there is in |I}. 
The other inner product, (b;|a2), is how much |b;)—one of the “new” basis 
states—there is in |az). Therefore, (a2|I)(bi|a2) is how much |b;) is in |T) 
due to its |az) piece. The total amount of |b) in |I} includes, in general, 
contributions from all n of the |a;)s—this is why the summation over j 
appears in Eq. (12.8). 

Now construct the transformation of y(x) into momentum space. The 
position eigenstates are now the “old” basis states; the momentum eigen- 
states, the “new” basis states. In position representation, these states are 
6(a — xo) and Ne’?0*/", respectively (with zo and po possible position and 
momentum values). 

The expansion coefficient projecting |I} onto |az) was, of course, an 
inner product. Similarly, the expansion coefficient projecting ~ onto a 
position basis state, such as (x — xq), is an inner product; explicitly? 


(role) = f ce Or cn (12.9) 


Corresponding to the inner product (bi|a2) is the inner product of the 
position eigenstate, ô(x — zo), with the momentum eigenstate,!° Ne‘?ot/”: 


(po|to) = n f eT Por/hS(¢ — gojde = NeT? (12.10) 


In Eq. (12.8), obtaining the total contribution to one of the |bẹ}s 
required summing over the old basis states (over j). Similarly, the total 
contribution to one momentum eigenstate requires integrating over all the 
old basis states (over dx). Putting all this together, we obtain the following 
correspondence:!! 


n 


S"(aj|T)(bilaj)|b1) — nf W(a)e~ Por! dar, (12.11) 


j=l 


We are not yet done, though. Equation (12.11) accounts for the single 
basis state |b;) on the left, and the single momentum basis state Ne‘?0*/? 


9 Note that Eq. (12.9) could be interpreted as explicit confirmation that the numbers 
comprising a wavefunction are indeed expansion coefficients. 
10 Here the normalization factor, N, is taken to be real. 
11 Nothing corresponding to the ket |b) appears on the right-hand side of Eq. (12.11) 
because a wavefunction, such as w(x), is a set of numbers, not numbers and kets. See 
Section 12.4.1. 
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on the right. Thus, the integral in Eq. (12.11) does not give us the entire 
state ¢(p), but only ¢(po). To remedy this, we simply replace po with p, 
that is, we replace the number po with the variable p (note that this cannot 
affect the integration). The transformation of y(x) into momentum space 
is, therefore, 


(p) = nf pleje? dg. (12.12) 


We obtained Eq. (12.12) by developing an analogy with Eq. (12.8), so as 
to illustrate the parallels between the wavefunction case and the discrete 
eigenvalue case. In fact, however, Eq. (12.12) is a well known and reliable 
workhorse of applied mathematics: the Fourier transform. 

As a simple example, let’s transform a position eigenstate (x — £o) into 
momentum space. From Eq. (12.12) we have: 


zo (p) = n f lx — xoje Pr dx = Newipeo/h. (12.13) 


Evidently the form of a position eigenstate in momentum space, Ne~‘?*/", 
is similar to that of a momentum eigenstate in position space, Ne*?o?/", 
The two differ by a sign change in the exponential, and by the fact that 
for a position eigenstate, momentum is a variable and position a number 
(the position eigenvalue), while for a momentum eigenstate, position is a 
variable and momentum a number (the momentum eigenvalue). 

Transforming a momentum eigenstate into momentum space isn’t so 
easy (try it). Nevertheless, it’s not hard to see that a momentum eigenstate 
in momentum space must be a Dirac delta, that is, 6(p — po), in analogy 
with a position eigenstate in position space: d(x — xo). First, we’re dealing 
with a continuum of eigenvalues, and—just as for position—the Dirac delta 
is the tool to pick out just one. 

Moreover, for momentum eigenstates of the form 46(p — po), an analysis 
like that above leads to a transformation from momentum space to position 
space that’s very similar to Eq. (12.12) (another Fourier transform). So if 
we transform a Dirac-delta momentum eigenstate into position space (much 
like Eq. (12.13)), we clearly will obtain the proper momentum eigenstate 
in position space. 


12.2.3 Gaussians and Beyond 


Analytically evaluating the integral in Eq. 
impossible, depending on the form of y(x 


12.12) can range from easy to 
. An important yet tractable 


Ww a 


example is that of a Gaussian, such as U(x) = Ae~®*” in Section 12.1.1 
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(taking zo = 0 for simplicity). For this case, Eq. (12.12) becomes 
pasaj ce Peele A | ate A ay /48h a 
=e P/ABR i. e Ble +ip/2BR) gy — \/B eR ABR, (12.14) 


The integration over dx—carried out by completing the square in the 
argument of the exponential!?—results in a function of p. Moreover, this 
momentum-space function is a Gaussian, like its position-space counterpart 

This well-known result—that a Gaussian in position space transforms 
to another Gaussian in momentum space—displays a deeply pleasing sym- 
metry. That’s even more true if we consider the two Gaussians’ widths. The 
width of a Gaussian is conveniently defined as the separation between those 
points where its value falls to 1/e of its maximum. For U(x) = Ae~*”, this 
occurs at x = + B~'/?, so the width is 2B~!/?. Similarly, from Eq. (12.14), 
the width of ¢(p) is 4hB'/?. Thus, the two Gaussians’ widths are inversely 
proportional. 

As the spread in | (x)|?, the position probability distribution, increases, 
the spread in |¢(p)|?, the momentum probability distribution, decreases in 
inverse proportion, and vice versa. This behavior provides a visualizable, 
and often used, illustration of the uncertainty relation for position and 
momentum: ApAx > Ë (see Fig. 12.1). 

Useful as it is, this illustration can impart an artificially simple picture 
of wavefunction behavior. Suppose, for example, that we multiply U(«) by 


(a) (b) 
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Fig. 12.1 The widths of Gaussian (a) position and (b) momentum proba- 
bility distributions are inversely proportional. Solid lines correspond to one 
state; dashed lines to another. 


12 One must also evaluate a deceptively simple Gaussian integral, of the form 
2 
a e72? dx. These are discussed in many thermal/statistical physics texts, as well 
as some quantum texts, for example, Townsend (1992). 
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a phase factor e7, with y a real constant. This, of course, doesn’t alter the 
position probability distribution, but if we now calculate the momentum- 
space wavefunction, as in Eq. (12.14), we obtain a modified ¢(p). For this 
particular phase factor, ¢(p) will still be a Gaussian (though shifted), but 
one could imagine countless other phase factors which transform into non- 
Gaussian momentum-space wavefunctions. 

There is, in addition, an infinite variety of possible non-Gaussian mod- 
uli. The very nice symmetry that exists for a real Gaussian, as embodied 
in Eq. (12.14), is the exception rather than the rule. 


12.3 Phases and Time Evolution 
12.3.1 Free Particle Evolution 


The time evolution of wavefunctions plays an important role in quantum 
mechanics. It also provides a means to elucidate the roles of representations 
and phases in the context of wavefunctions. 

Multiplication of a wavefunction 7)(a,t) by an overall phase factor et 
(where A is a real constant) results in 


A 


p(x, t) = R(x, the") — R(x, tee), (12.15) 


This leaves the position and momentum probability distributions 
unchanged. By contrast, multiplication by the relative phase e’* ) yields 


plx, t) = R(x, the) — R(x, the* eS), (12.16) 


which does not alter the position probability distribution, but does alter the 
momentum probability distribution. The situation is analogous for ¢(p, t), 
the momentum-space wavefunction corresponding to (a, t), in which case 
a relative phase would be a function of p. All this was discussed in Section 
10.3 where, in addition, we saw that this behavior of wavefunctions with 
respect to phase changes is entirely consistent with what occurs in the 
discrete eigenstate case. 

A free particle provides the simplest example of wavefunction time evo- 


lution. The classical Hamiltonian, denoted Hy, is now simply the kinetic 
a2 


energy: Hy = a The corresponding Hamiltonian operator is H f=. 
As yet, no representation has been chosen. In position space, p = ihe, 
so that H f= ma di Given this, the position-space time evolution 
operator is 
Pa iH et 2 
Üle) = exp (4) = exp (H 6). (12.17) 


An exponential with a in its argument—what do we do with this thing? 
We know that the position eigenstates are not eigenstates of p, so they’re 

a a2 A 
also not eigenstates of Hy = £-. And if they’re not eigenstates of Hy, 
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they’re not eigenstates of h = e*Ast/h Tf an operator does not act on 
its eigenstates, it can’t be replaced by its eigenvalues. So in position space, 
Hy can’t be replaced by its eigenvalues. 
Clearly, however, the eigenstates of p are eigenstates of ÎĤ f= 2, and 
m 


therefore of U, = e~*#t/", In momentum representation, then, the oper- 
ator p can be replaced with its eigenvalues: p —> p. But then p? in U (t), 
also, can be replaced (with p°). 

This suggests time evolving the state in the momentum representation. 


For an initial state (p, 0), the time-evolved state d(p,t) is 


(pt) = O(1)6(p,0) = exp (—4#*) o(p,0) 
= exp (52) (p, 0) = exp (52) $l(p,0). (12.18) 


Mathematically, the time evolution of ¢(p,0) has been reduced to simply 
multiplying together two functions of p. 

Time evolution of a wavefunction, as embodied in Eq. (12.18), is 
quite analogous to the time evolution of non-wavefunction states, as in 
Chapter 11. There, an initial state was expanded in the eigenstates of U (t), 
whose action was then simply multiplication of each coefficient by a (time- 
dependent) phase factor. This is precisely what happens in Eq. (12.18) for 
a wavefunction. 

Note that, in Eq. (12.18), ¢(p, 0) is multiplied by a relative phase factor: 
one that’s different for different values of p. And, as expected, this doesn’t 
change the momentum probabilities: 


: 2 
|$(p,0)|” = [exp (35) 6(7,0)| = [6D] 
It does, however, alter the position probabilities—again, as expected. To 
see this, we need to consider how one actually transforms a state from the 
position representation to the momentum representation and back. 

Figure 12.2 (a) illustrates the identical position probability distribu- 
tions for two states (wavefunctions) with identical Gaussian moduli (in 
position space). One of these states has no phase. The other has a phase 


(12.19) 


factor of the form exp i { -32 sin(4x) | . As a result, the two states’ 


momentum probability distributions vary greatly (see Fig. 12.2 (b)). One 
expects that, because of these different momentum probability distribu- 
tions, the states will time evolve differently, despite their initially identical 
position probability distributions. 

Again (as in Section 12.2.3), certain dangers are inherent in considering 
only Gaussians. Consider first a classical free particle with initial velocity 
zero and initial position x9. The time-evolved system is x(t) = zo: the par- 
ticle remains at xo forever. If any physical system exhibits trivial behavior, 
surely this is it. 
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(a) (b) 


Probability 
Probability 


Position (x) Momentum (p) 


Fig. 12.2 (a) Two states with moduli of the form e* (Gaussians) have 
identical position probability distributions. (b) But if their phases differ, 
their momentum probability distributions can differ. Here, one has no 
phase (dashed line), but the other (dotted line) has a phase of the form 
ei{ 2? /2—3a/2—sin(4x)} 


Yet the corresponding quantum-mechanical problem is anything but 
trivial. There is an infinite variety of possible initial states with (x) = xo 
and (p,) = 0. Hardly any of these can be time evolved analytically. The 
notable exception, once again, is a real Gaussian. It is not hyperbole to say 
that analytically solving the general free-particle problem is trivially easy 
in classical mechanics, and impossibly difficult in quantum mechanics. 


12.3.2 Wavepackets 


Chapter 9 focused largely on energy eigenstates in position space—a par- 
ticularly important type of wavefunction. But these comprise only a subset 
of all wavefunctions, and focusing on them can obscure the fact that, in 
quantum mechanics, particles are typically best represented not as energy 
eigenstates, but as wavepackets—spatially localized wavefunctions. Section 
9.4 considered scattering from step and rectangular potential barriers using 
energy eigenstates. But the “real” problem involves particles, represented 
by wavepackets. 

One of the more puzzling results of Section 9.4 concerned the step poten- 
tial for the case E < Vo, that is, when the particle energy was less than 
that of the potential barrier. In that case the probability decreased expo- 
nentially inside the barrier, but the physical interpretation of that fact was 
unclear. 

Time evolving a wavepacket incident upon the barrier helps clarify the 
physics. As for the free particle case, an infinite variety of initial wavepack- 
ets is possible. Computer-generated results for an initial Gaussian show 
that, upon interacting with the barrier, the packet gradually splits up. One 
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piece rapidly forms a reflected packet. But some of the wavefunction pene- 
trates into the barrier, reverses direction, and eventually forms a delayed, 
reflected packet. 

This provides us with a somewhat more picturable situation. Rather 
than a wave of infinite extent outside the barrier, and exponential 
decrease within it—all time independent—the wavepacket exhibits time- 
dependent behavior that’s at least particle-like. A wavepacket treatment 
also lends additional insight into the other cases in Section 9.4, including 
tunneling. 13 

Many potentials are far more complicated than the step potential. The 
interaction of particles with such potentials is properly represented, and 
understood, not through energy eigenstates, but through the time evolution 
of wavepackets. 


12.4 Bra-ket Notation 
12.4.1 Quantum States 


Because bra-ket notation can be subtle in wave mechanics, I’ve largely 
avoided its use in this chapter so far. Now, with the basic concepts of 
wavefunctions in hand, it’s time to examine bra-ket notation in the context 
of wavefunctions. Pll do so by again comparing the discrete basis state case 
to wavefunctions. 

In either case—discrete basis states or wavefunctions—basis states may 
be written as kets. For example, suppose that {|a,)}, with j = 1,2,...,m, 
constitutes a set of discrete basis states. A particular ket, such as |a3), is 
simply the third basis state in the set. 

Corresponding to some position £p is the position basis state |x}. But 
these basis states are continuously distributed along the x axis, so it makes 
no sense to talk of, say, “the third position ket”. And because x is a con- 
tinuous variable, the ket |x} (without subscript) is taken to represent the 
complete, infinite set of all position kets, all position basis states. The 
counterpart of |x), then, is not some state |a;), but the complete set {|a;)}. 

Now suppose that |y} is a state in the space spanned by the set {|a;)}. 
We can take the ket |y} to denote the state abstractly, that is, without 
adopting any specific representation (basis) for the state. Of course, we 
could choose to expand |y) in, say, the a representation, 


ly) = So gla) = Sfagly) lay). (12.20) 


j j 
The state now appears as a superposition, a weighted sum, of kets. 


13 Computer-generated results may be found in, for example, Goldberg (1967); Schiff 
(1968), pp. 106-107. 
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By contrast, in matrix mechanics a state appears as a set (not a sum) 
of numbers (not kets). For example, |y) would be represented as a col- 
umn vector, the elements of which are expansion coefficients; if we work 
in the a basis, these elements are the (a;|7)s. The kets themselves do not 
appear—which is why, to properly interpret a column vector, we must know 
what basis we’re working in. 

A wavefunction, also, may be written as an abstract ket, a superposition 
of basis kets, or a set of numbers (expansion coefficients). This fact is 
key to developing a correct understanding of Dirac notation as applied to 
wavefunctions. 

Take the abstract (representation-free) wavefunction of a system to be 
w, or in bra-ket notation, |w). If this state is expanded in position space, it 
can be written either as a superposition of kets or as a set of numbers. 

In conventional notation, if w is expanded in position space and written 
as a set of numbers, the result is y(x), that is, a function of x.!* In bra-ket 
notation, this set of numbers appears as (a|w), so that 


(a) = (lw). (12.21) 


Simple though it is, Eq. (12.21) warrants careful interpretation. Really, Eq. 
(12.21) is more of a translation between conventional and bra-ket notation 
than an equation in the usual sense. Moreover, “the” inner product (|) is 
in fact an infinite set of inner products, since |x}, and thus also (x|, denotes 
the complete set of position basis states. That’s why what looks like a single 
inner product, a single number, is in fact the entire function y(x). 
Instead, our wavefunction could be written as a superposition of kets, 


w= f alwaar. (12.22) 


—0O 


Given the replacement of the summation in Eq. (12.20) with an integral in 
Eq. (12.22), the correspondence between the two equations seems straight- 
forward. Note that both are “dimensionally consistent,” in the sense that 
both sides of each equation consist of kets. Still, interpretation of Eq. 
(12.22) is quite subtle. 

Because |x} denotes the complete set of position basis states, then as 
just discussed, (a|w) is the complete set of all such inner products. But if 
all inner products are already accounted for, and indeed, if y(x) already 
appears in the integrand (as (2|q)), why must we integrate over dx? And 
what can such an integration even mean? 


x oat 2, š s 
14 A wavefunction, such as Ae~?*”, is effectively a set of numbers (a number is 


associated with each x value), and thus analogous to a column vector. However, the 
representation is now obvious: a function of x is in position space; a function of p, in 
momentum space. 
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Let’s return to Eq. (12.20). Suppose we calculate all the terms—all the 
weighted kets of the form (a;|y)|a;)—in this equation. This set of weighted 
kets is not the state |y). Only when a superposition is formed—when 
the terms are added up—do we obtain |y). The integrand in Eq. (12.22), 
(x|)|x), corresponds to the set of (a;|y)|a,;)s. And just as the (a;|7)|a,;)s 
must be added to obtain |7), so must the (2|w)|x)s be integrated to 
obtain |Y}. 

In physics, the careless use of notation often leads to trouble, and when 
Dirac notation is applied to wavefunctions, the pitfalls can be treacherous. 
One might think, for example, that because our wavefunction could be writ- 
ten as either |Y) or (a|wW), we can write: |) = (a|w). But this is nonsense 
(and it even looks wrong). Both |w) and (x|y) are legitimate expressions 
for our wavefunction, but within different frameworks: a ket can’t equal a 
set of numbers. 

One more comment about notation: to denote the same state in different 
representations, I’ve used different symbols, such as y(x) and ¢(p). This 
convention makes clear that y(x) and ¢(p) are different functions, but it 
obscures the fact that they represent the same state. 

Instead, the same symbol could be used for both, such as u(x) and 
(p). This convention makes clear that y(x) and ~(p) represent the same 
state, but it obscures the fact that they are different functions. 

In bra-ket notation, the position and momentum representations of 
the state |w) are (z|) and (plz), respectively. By contrast, (x|) and 
(p|¢) would normally represent two different states, |Y) and |@), writ- 
ten in position and momentum space, respectively. This suggests that the 
same-symbol convention is more in accord with bra-ket notation than the 
different-symbol convention. Ultimately, either is acceptable, so long as it’s 
understood what is meant. 


12.4.2 Ejigenstates and Transformations 


Both |w) and (a|w) represent the same quantum state: one as an abstract 
ket, the other as a function in position space. Similar considerations apply 
to eigenstates of ĉ or Ð. 

The kets |x) and |p) are the complete sets of abstract position and 
momentum eigenstates, while |xo), for example, is one such state. However, 
(x|xo} is the position-space representation of this state: (x|xo) = 6(a— xo). 
By contrast, (p|zo) is the state’s momentum-space representation: (p|xo} = 
Neipto/h. 

Because of its inherent importance, and because it provides an oppor- 
tunity to explore Dirac notation, the Fourier transform (see Section 12.2.2) 
is worth re-examining. Using y(x) = (s|) (Eq. (12.21)) and (p|ao) = 
Ne’P*0/" we can construct the transform from position to momentum space 
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in Dirac notation by direct translation from Eq. (12.12):'° 


=N f pae da f (ele) oleae 


= f” Wia)telwyae. (12.28) 


—oo 


From Eq. (12.11), we can construct a correspondence between coeffi- 
cients, 


Loloa — f (polz) (ald) dx (12.24) 


j=1 


The left side is the coefficient of the basis state |b,), while the right side 
is the coefficient of the basis state |po) (i.e., it’s @(po)). Here (a;|P) corre- 
sponds to (a|~): both are the projections of the respective system states 
onto the sets of old basis states. In addition, (b;|a;) corresponds to (po|z): 
both are the projections of the sets of old basis states onto one of the new 
basis states. 

The study of wavefunctions constitutes a major part of quantum 
physics, and we’ve only scratched the surface. Nothing, for example, has 
been said about wavefunctions for non-free particles: particles subject to 
forces. But the goal of this chapter, and of this book, has been to develop the 
basic conceptual and mathematical tools and ideas that underpin the fun- 
damental structure of quantum mechanics. In that, I hope I have achieved 
some measure of success. 


12.5 Epilogue 


Our discussion of quantum mechanics now comes to a close. If there is one 
overarching lesson, it might be this: quantum mechanics is fundamentally 
about calculating the possible values of physical measurements, and the 
probabilities of obtaining those values; the quantum state is the entity 
wherein these probabilities reside. Moreover, while thoughtful people of 
good will can disagree about the metaphysical meaning of probabilities 
in quantum mechanics, it’s clear how those probabilities are manifested 
physically: as the statistical results of an ensemble of measurements. 

The goal of this book—to develop the tools and ideas necessary to think 
clearly about quantum mechanics—was relatively modest. Still, you may be 


15 Note that [°° |x)(x|dx is the identity operator of Section 5.3.2 for position basis 
states, so the last expression in Eq. (12.23) is really (p| Îl). 
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left with many questions—both mathematical and conceptual. Take com- 
fort in the fact that you’re in good company. As Niels Bohr famously said:16 
“Anyone who is not shocked by quantum theory has not understood it.” 

In Hinduism, Brahman is the ultimate reality—the great, ineffable spir- 
itual principle of the cosmos. In the Vedas, the ancient textual sources of 
Hinduism, the indescribability of Brahman is a recurrent theme. So when 
the disciple seeks to compare Brahman to the things of this world, he is 
rebuffed: “Not this, not this,” he is told. Disconcerting, perhaps—but then 
why should we expect anything less, anything easily comprehensible, of 
ultimate reality? 

Quantum mechanics is the greatest, the most profound of revolutions in 
our modern view of the physical world. Even for experts, acheiving a deep 
conceptual understanding of quantum mechanics can be an elusive goal. 
Quantum mechanics remains stubbornly recondite, notoriously abstract. 
Disconcerting, perhaps—but then, as the gateway to a radically new view 
of physical reality, why should we expect anything less? 

Happy thinking! 


The best things can’t be told: the second best are misunderstood. 
Heinrich Zimmer!” 


12.6 Problems 


1. Suppose a wave function V(x) is modified by a position-dependent 

phase, such that: U(2) —> exp(ipox/h)VU(x) = W'(x), where po is 
a constant. Show explicitly how, if at all, this will modify (p,), the 
expectation value for momentum. (It’s probably easiest to not use 
bra-ket notation.) 
Sketch some ¢(p), and a corresponding ¢’(p), to illustrate the effect of 
the above transformation on the momentum amplitude distribution. 
What physical effect do you think this transformation will have on 
the state? Discuss your results in the context of phase changes. 

2. Suppose All Bx) = Ex|Ge), where k = 1,2,3, and that |a) = 
M am|6m). (You can regard H as the Hamiltonian, though it’s 
irrelevant for this problem.) 

(i) Write out (a|H|a) in bra-ket notation; carry the calculation as far 
as possible with the given information. 

(ii) Clearly and concisely explain why your result is of the proper form 
for the expectation value of H in the state |a}. That is, why does it 
“make sense” that your equation is that of (H)? 


16 Bohr’s comment, though famous, is also mysterious—I have been unable to locate 
its source. 
17 Quoted in Campbell (1985), p. 21. 
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(iii) For an arbitrary position-space wavefunction w(a,t), the expec- 
tation value for position is: 


(laid) = if ie naa oe le: 


Carry out an analysis analogous to that in Part (iii). That is, why 
should the above integral form an expectation value for position? Iden- 
tify corresponding mathematical objects in the two expressions. Also, 
what is the justification for removing the hat from x when it appears 
inside the integral? 

. The time evolution of a free particle is given by 


co 
Dmt) = fei") (pled. (12.25) 
beT) 
This equation holds for any initial state. Your job will be to explain 
how one should interpret the above equation by comparing it to a more 
familiar case. Assume the equations in the introduction to Problem 2 
hold, and that H there is a time-independent Hamiltonian. 
(i) Write an equation that shows how to time evolve the state |a); 
carry the calculation as far as possible with the given information. 
(ii) Assume the reader is familiar with equations such as the one 
you wrote in Part (i), but not with Eq. (12.25). By comparing the 
two equations, explain why Eq. (12.25) gives the time evolved state. 
Your explanation should focus on comparing/identifying correspond- 
ing terms in the two equations; you should also explain why one 
involves a sum and the other an integral. (If you had asked for the 
explanation, could you understand your answer?) 
. Assume that the initial state of a free particle, (x), and the 
corresponding momentum-space state, ¢(p), are given by: 


1 2 2 a 2,2 2 
= —a*/2a = —p a” /2h 
x)= e ; = e 
W= 60) = fa 
(i) Calculate (x) and (a?) for this state. 
(ii) Calculate (p) and (p°) for this state. 
(iii) Show that AxAp > Ë is satisfied as an equality for this state. 


Note: This problem involves Gaussian integrals. Two of them are triv- 
ial, if symmetries in the integrand and limits are exploited. The other 
two should be looked up (discussions of Gaussian integrals may be 
found in many books). 

. Suppose we would like to investigate the time development of the 
following free-particle initial state: 


P(x) = Ne'?0*/*, for a<|a| <(L+a), 
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and W(x) = 0, elsewhere. (Take N to be a positive, real normaliza- 
tion constant.) Such a task would ultimately require solution through 
numerical (computer) methods. Nevertheless, we would want to carry 
out as much of the calculation as possible using analytical methods. 
So, do the following. 

(i) Sketch R(x), the modulus of U(x). 

(ii) Construct the properly normalized initial state (i.e. determine N). 
(iii) Construct ®(p), the initial momentum-space wavefunction. Sim- 
plify your result. 

(iv) Write down the explicit form of ®(p,t), the time-dependent 
momentum-space wavefunction, as an integral. Simplify, but don’t 
evaluate, the integral. 


Appendix A 
Mathematical Concepts 


This appendix is intended to provide a very brief review and summary—not 
a systematic development—of standard mathematical topics used in this 
book. 


A.1 Complex Numbers and Functions 


Even basic quantum mechanics demands some familiarity with complex 
numbers and functions. The starting point for complex analysis is the def- 
inition of the number i (also called j): i = —1. From this definition we 
have: i? = —1; moreover, —i? = —(ii) = 1. It’s a fascinating fact that a 
great deal of useful mathematics springs from i, a quantity that, on its face, 
looks like patent nonsense. 

The number i is the simplest example of an imaginary number, and 
multiplication of i by any “ordinary”, or real, number, is also an imaginary 
number. For example, 3i, —127, and ri are all imaginary numbers. 

A complex number is comprised of the sum of a real number and an 
imaginary number, such as —7+ 47, or 3— Ti. Clearly, any complex number 
may be written as 4+iB, where A and B are real numbers. Addition, sub- 
traction, multiplication, and division are the same for real and for complex 
numbers—for complex numbers, of course, we must be careful to properly 
manipulate i.1 

By definition, the complex conjugate (or just the conjugate) of any num- 
ber, is simply the original number, but with ¿ multiplied by —1 at each 
occurrence. If C is the complex number C = A+ iB, then its complex 
conjugate, denoted C*, is C* = A—iB. The complex square of C, denoted 
|C|? (not C2), indicates multiplication of C by C*: |C]? = C*C = CC* = 
A? + B?. 

A real function, such as f(x) = sin(x), may be thought of as a collection 
of real numbers, in the sense that a real number corresponds to each value 
of the independent variable x. Some functions associate a complex number 
with each value of an independent variable, and so may be thought of as a 
collection of complex numbers. For example: g(x) = z? +icos(x) associates 
a complex number with each x value. 


1 In this regard, note that +4 = Fi, which can be seen by multiplying through by i. 
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The complex conjugate of a function is obtained as for complex num- 
bers: the sign of i is changed everywhere it occurs; thus, g*(x) = xz? — 
icos(z). Similarly, the complex square of a function is obtained as it is 
for a number. For example: |g(x)|? = g(x)g*(x) = xt + cos?(x). In fact, 
basic operations on functions go through as for real-valued functions. For 


example, if h(x) = e” — ix?, then, 


g(x)h* (x) = 2*e* + izt + ie” cos(x) — z? cos(x) 


= x*[e* —cos(x)| +i [xt + e*cos(x)]. (A-1) 


A crucial difference remains, however, between complex numbers and 
functions and their real counterparts. An equation involving only real num- 
bers and/or functions is just that: an equation. But an equation involving 
complex quantities may, and often should, be thought of as two equations. 
The reason is that each “piece”, real and imaginary, must be independently 
satisfied. As a simple example, if a, 3, y, and 6 are all real numbers, and 
if a+ iß = y+ iô, then we must have a= y and 8 = ô. 

In this section I’ve introduced only the bare beginnings of complex 
analysis. A more in-depth discussion appears in Chapter 10. Still greater 
depth may be found in almost any mathematical physics text. 


A.2 Differentiation 


Suppose f is a function of x, that is, f = f(x). The derivative of f with 
respect to x is written df /dx. Differentiating f (or “taking the derivative” 
of f) with respect to x yields the rate at which f changes as x is varied. 
The derivative df /dx is, in general, another function of x. The expression 
(df /dx)|z+—7, denotes the value of this function when evaluated at x = xo. 

Graphically, df /dx is the slope of the graph of f(a) vs. x. Figure A.1 
(a) shows such a graph, and the slope at x = xo, that is, (df/dx)|c=2,. 


(a) (b) 
f(x) 4 


> > 
Xo x x 


Fig. A.1 (a) A function f(x), and its derivative (slope), evaluated at x = zo. 
(b) Sketch of the function z (solid curve) and its derivative, x (dashed line). 


Differentiation 177 


These concepts become more clear if we introduce some specific techniques 
of differentiation. 
Following are some basic derivatives. Take 8 to be a constant (i.e., 


BF B(a)). 


d8 d -pa dz? — g-4 
Fe ght (@) = BS F(a) ae 
d d 
x sin(x) = cos(x) — cos(x) = — sin(x) —e” = e” (A.2) 
dx dx dx 
Consider a simple function: f(x) = «7/2. Differentiating, we have: 


42° _ x These functions are plotted in Fig. A.1(b). Note how these 
plots “make sense.” Inspection of x? /2—the solid curve—reveals that as x 
increases, so does the slope of «7/2 (and the slope is always positive). This 
behavior is reflected in the plot of the slope—the dashed line in Figure 
A.1(6). It’s also reflected in df/dx = x, which (for x > 0) is always positive 
and increases with a. 

There are also some important, general rules that apply to differentia- 
tion. First, the derivative of a sum is the sum of the derivatives. Thus, for 
two functions of x, f(a) and g(x), we have 


at Oh a (A.3) 


In addition, the derivative of a product of functions, such as f(«)g(x), is 


d dg df 


dat 9 = fIr Ir (aa) 


In words, the derivative of a product is the first function times the derivative 
of the second, plus the second times the derivative of the first. 

Finally, the “chain rule” is of great importance. Suppose that h is a func- 
tion of another function, g(x): h = h(g(x)). How do we calculate dh/dx? 
The answer, in essence, is: Take derivatives (i.e. calculate rates of change) 
for each dependence, until you get to the x dependence. Then multiply 
these rates together to get the total rate of change. Thus 


dh _ ah(g(x)) dole) 


A.5 
dx dg(x) dx (ea 
For example, if h = cos(g(x)) and g(x) = x”, then 
(z2) dr2 
O a Le NO pe tay. (A.6) 


dx dx? dx 


Applications of the chain rule are widespread, and can be quite subtle. 
Now suppose a function depends on two variables x and y, e.g. G = 
G(x,y), and suppose we wish to differentiate G with respect to only one 
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of these, say y. To denote such partial differentiation, we use a partial 
derivative: OG(«,y)/Oy, meaning “take the derivative of G with respect to 
x, treating y as a constant.” 

For example, suppose G = 3xy? + cos(x) + sin? (xy). Then, 


ac = 6ry + 0 + 2sin(xy) cos(xy)a. (A.7) 
y 


Note that differentiation of the last term in G required careful application 
of the chain rule, 


ð sin? (xy) O sin? (zy) sin(xy) Ory 
Oy ~ Osin(xy) Oxy Oy 


= 2sin(xy)cos(ry)x. (A.8) 


Differentiation—whether ordinary or partial—is usually not terribly 
difficult. The study of ordinary or partial differential equations, however, 
constitute major and challenging fields of mathematics. 


A.3 Integration 


The other basic operation in calculus (besides differentiation) is integra- 
tion. The indefinite integral (with respect to x) of a function a(x), written 
J a(x)dzx, is, in general, another function of x. Integrating a(x) (or “taking 
the integral” of a(x)) “adds up” the area under a graph of a(x). 

An integral that includes limits is called a definite integral. Suppose 
that f a(x)dx = A(x); then the correponding definite integral is evaluated 
thus? 


/ OG AGH | PRGA shaw, (A.9) 


This integral yields the area under the graph of a(x) between the limits x, 
and x2. Evaluation of a derivative at a specific point yields a number (not 
a function). So too, evaluation of an integral over a specific interval yields 
a number—the area under the graph within that interval—not a function. 

Figure A.2 illustrates integration graphically. The sum of the rectangles’ 
areas approximates the area under the graph. To improve the approxima- 
tion, we can increase the number of rectangles, and decrease their widths 
accordingly. If this process is continued until each rectangle is infinitesi- 
mally narrow (a mathematical abstraction), we obtain the integral, and 
the result is exact. 

To clarify the meaning of integration, suppose a refinery receives oil 
both by railcar and pipeline. The volume of, say, the kth railcar, Vz, is 
known. But for the pipeline, what’s known is the time-dependent rate of 


2 A(x) is not to be confused with the real number A in Section A.1. 
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f(x) + 


> 
x 


Fig. A.2 The integral of f(x) is the area under the plot of f(x). The sum 
of the rectangles’ areas approximates the exact area (i.e. the integral). 


delivery, R(t). Then Viotat, the total oil delivered to the refinery from time 
ti to time tf, is 


ty 
Viota = Vj + 1 R(t)dt. (A.10) 
J 


i 


That is, we sum the discrete Vs for all railcars delivered from t; to tf, 
and integrate the continuous rate, R(t), from t; to ty. In this sense, inte- 
gration is analogous to summation, but integration applies to continuous 
distributions. 

Finally, it’s important to recognize that integration and differentiation 
are inverses of each other (and for this reason, integrals are sometimes 
called anti-derivatives). For example, if, as above, f a(x)dx = A(x), then 
(d/dx) A(x) = a(x). This implies, also, that adding a constant ( to the 
solution of an indefinite integral is also a solution, because #4 A(x) = 
ds [A(x) + 8]. 

In practice, integration is notoriously challenging, and it’s easy to con- 
jure up integrals that are difficult or impossible to solve analytically. Still, 
integrals often can be solved. One simple and general rule is that the inte- 
gral of a sum is the sum of the integrals. Here, in addition, are a few simple 
integrals. 


[ sae = 90 [otwd=o [tear fora 


[snloax = — cos(x) [costa)de = sin(x) pew = e” 
(A.11) 


3 The integral of a product, however, is not the product of the integrals: 
S f(z)g(x)dx # f f(x)dx f g(x)dx. If it were, integration would be immeasurably easier! 
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Take a few minutes to compare Eqs. (A.11) and (A.2). In particular, note 
how corresponding derivatives and integrals are indeed inverses of each 
other. 

For more challenging integrals—and they are legion—refer to integra- 
tion techniques in applied mathematics or mathematical physics texts. 
Next, try tables of integrals. If that doesn’t work, numerical (computer) 
integration may be required. Good luck. 


A.4 Differential Equations 


I will not even attempt to present methods of solution of either ordinary or 
partial differential equations in this section. Rather, our goal is to review 
what differential equations, and their solutions, are. 

A differential equation is an equation that involves derivatives of an 
unknown function. Solving a differential equation, then, is not simply a 
matter of differentiating a known function (if that were the case, solving 
differential equations would be easy). Rather, it consists of determining the 
unknown function which satisfies the differential equation. 

A simple example is Newton’s second law for one-dimensional motion, 
with constant mass: F = ma. This equation describes the trajectory, x(t), 
of a classical object. It’s deceptively simple, for two reasons. First, F may be 
a function of space and/or time: F = F (x,t). And second, the acceleration, 
a, is actually the second derivative (in time) of the position, x(t). Thus, 
with a bit of simple algebra, F = ma becomes 


i? 1 
u(t) - F (e,t) =0. (A.12) 


Solving Eq. (A.12)—which requires that F (x,t) be specified—consists of 
determining the explicit form of the unknown function x(t). 

Although solving differential equations is often very challenging, check- 
ing a solution is usually easy. The solution is simply substituted back 
into the differential equation, derivatives are taken, and if the differential 
equation is satisfied, the solution is valid. 

It’s important to realize that, unlike algebraic equations, a differential 
equation typically has an infinite number of valid solutions. To see this, 
consider the trivially simple case of Newton’s second law (Eq. (A.12)) with 
F = 0. Then any function of the form x(t) = a+ bt, with a and b constants, 
satisfies the differential equation. 

Of all valid solutions, however, only one describes a given physical sit- 
uation. Suppose, for example, that at time t = 0 the object described by 
x(t) = a+ bt is at x = 3 with speed v = 2 in the +z direction. The partic- 
ular solution that describes this physical situation is z(t) = 3+ 2t, that is, 
a = 3 and b = 2 for this case. 
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A differential equation that involves only ordinary (non-partial) deriva- 
tives, such as Eq. (A.12), is an ordinary differential equation, or ODE; if 
it involves partial derivatives, it’s a partial differential equation, or PDE. 

The study of PDEs arguably constitutes the central subject in applied 
mathematics. Rather than superficially discuss some of the many—often 
quite sophisticated—methods of attack that exist for PDEs, Pll simply 
make a few broad remarks. 

Because partial derivatives involve differentiating with respect to one 
variable while holding all others constant, PDEs evidently involve multi- 
variable functions. For the ordinary differential equation of Eq. (A.12), 
the choice of two numbers, our inital conditions, selected one particular 
solution, corresponding to the physical situation of interest, out of the 
infinite set of possible solutions. 

A PDE, also, possesses an infinite set of solutions. But selecting out 
a particular one of these, corresponding to a particular physical situation, 
requires specification not simply of numbers, but of an entire function. 

The following simple PDE is the traveling wave equation: 


= f(x, t) = v= f (x,t). (A.13) 


Equation (A.13) describes waves on a string, or electromagnetic waves in 
vacuum; such waves do not change shape as they travel. Here v is the wave’s 
speed, and f(a,t) is its amplitude (e.g. for a wave on a string, f(x, ¢) is the 
lateral displacement of the wave at position x and time t). 

It can be shown (if one is careful with the chain rule!) that any func- 
tion of the form f(x—vt) satisfies Eq. (A.13).4 Clearly, there are an infinite 
number of such functions. The correct one in any particular case is deter- 
mined by the initial form of the wave—the “initial condition” is an entire 
function. 

Another form of wave equation is the time-dependent Schrödinger 
equation: 

or, cca HWV(c,t). (A.14) 
Ot 
Equation (A.14) is central to quantum mechanics. In practice, É involves 
partial derivatives of x, so Eq. (A.14) is in general a highly non-trivial 
PDE— in fact, because there’s an infinite variety of possible Hs, Eq. (A.14) 
represents an infinite variety of PDEs. 

Unlike Eq. (A.13), solutions to Eq. (A.14) generally do not maintain 
their shape as they propagate (see Fig. A.3). Like Eq. (A.13), however, 
selection of a particular solution from the infinite set of possibilities requires 
specification not simply of a number, but of the initial function. 


4 To clarify: f(a, t) denotes some as-yet unknown function of x and t; f (x—vt) denotes 
any function of x and t with argument x — vt, such as exp [- (a — vt)?]. 
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x=0 x 


Fig. A.3 Waves satisfying Eq. (A.13) (solid curve) do not change shape 
from, say, t = 0 to t = T; those satisfying Eq. (A.14) (dotted curve) typically 
do. 


Again, this discussion of differential equations was intended only to illus- 
trate what they are, not to develop methods of solution. For that, consult 
mathematical physics texts, or treatises devoted to the subject. 


Appendix B 
Quantum Measurement 


The so-called “measurement problem” is probably the single most dis- 
cussed and debated issue in the foundations of quantum mechanics.! Our 
goal, however, is not to tackle the measurement problem, fascinating and 
profound though it is, but to understand some practical aspects and impli- 
cations of quantum measurements, within the framework of the quantum 
postulates. My focus will be on imparting a clearer conceptual picture of 
quantum measurement, leaving technical details aside. 

In Postulate 2 (of Chapter 2), we introduced the eigenvalue equation 
AW, = azVx. Postulate 3, then, was the following. 


If a measurement of the observable corresponding to the operator 
A is made on the normalized quantum state w, given by, 


b= So ntn, (B.1) 


where the W,,’s are eigenstates of A and the cn’s are expan- 
sion coefficients, then aj, the eigenvalue of Y}, will be obtained 
with probability le; |?. The system will be left in the state Y; 
immediately after the measurement. 


There are three questions we wish to address regarding Postulate 3. 


1. How does one actually “make a measurement” in quantum mechanics? 

2. Of all possible eigenvalues, what determines which one is obtained in 
any given measurement? 

3. What are the practical implications of leaving the system in the 
eigenstate corresponding to the eigenvalue that was obtained upon 
measurement? 


A quantum measurement may be thought of as consisting of two dis- 
tinct but related processes. First, a correlation is established between the 


1 A vast literature exists on the measurement problem. Some useful, if rather 
advanced, references are: Ballentine (1998); and Wheeler (1983). Note that Wheeler 
(1983) contains (pp. 152-167) a translation of Erwin Schrédinger’s paper in which 
he introduces the famous “Schrddinger’s cat”. There are, in addition, countless 
popularizations dealing with the measurement problem. 
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system to be measured and a measuring device. That is, some physical 
interaction occurs that couples the state of the micro-system to the state 
of the macro-system. In particular, the various eigenstates of the observ- 
able to be measured are correlated with different states of the measuring 
device.” 

These various states of the composite system—eigenstates of the observ- 
able and states of the measuring device, coupled together—are themselves 
coupled to increasingly macroscopic features of the device. At some point, 
presumably when the interactions approach the macroscopic level, the 
second process occurs: a particular one of the various possibilities, cor- 
responding to one of the eigenstates, becomes physically manifested in 
the device. The other possibilities—and the other eigenstates—then simply 
cease to exist, both in the measuring device and in the quantum system. 
This process is often called collapse of the quantum state.? The “argu- 
ment,” in essence, is that this must be the case, because we do not observe 
superpositions of possibilities in the macro-world. 

Analysis of the first part of this process, establishing correlations, may 
be a fearsome mathematical task in any given case.4 But even if were 
not clever enough to figure it out mathematically, there’s no problem of 
principle in imagining some physical interaction that leads to the required 
correlation between the eigenstates of some observable and measuring 
device states. (The magnetic field in the Stern—Gerlach device introduced 
in Chapter 8 served just this role.) 

The deep foundational problem stems from the second process. That is, 
what determines which particular eigenvalue we obtain upon measurement? 
This largely is the quantum measurement problem. There still is no gener- 
ally agreed upon, satisfactory resolution of this problem (indeed, there’s not 
even universal agreement that it is a problem). Quantum mechanics does 
gives us the probability distribution for the different possible measurement 
results—but that’s all it gives us.° 

Evidently questions 1 and 2 are intertwined. A measurement consists of 
the establishment of correlations, followed by state collapse. This collapse 
process, however, offers no explanation of why we obtain one eigenvalue 
rather than another: our second question must remain unanswered. 


2 A discussion of such combined, or “composite,” quantum systems is beyond the 
scope of this book. 

3 This process is also called collapse of the wavefunction, reduction of the state 
vector, etc. 

4 Bohm discusses this correlation process in: Bohm (1951), Chapter 22. 

5 The problem isn’t simply that collapse occurs, but how it does so. Collapse occurs by 
a process fundamentally different from “normal” quantum time evolution. The question, 
then, is, What, physically, justifies this process overriding normal time evolution? See 
in particular Bell (1987), p. 117. 
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From a calculational standpoint, it is question 3, regarding the practical 
implications of leaving the system in the eigenstate corresponding to the 
eigenvalue obtained upon measurement, that is of most interest to us. 

The answer is really quite simple. Suppose that at time tı our system 
is in the state Y of Eq. (B.1). We perform a measurement of A and obtain, 
say, a3. Then immediately after the measurement the system state is Ws 
(see Postulate 3, above). If we want to then time evolve the system from 
time tı, we must start in the state Y3. This is a straightforward application 
of Postulate 3, yet it’s often overlooked by students. 

Similarly, suppose that Y3 is a stationary (time-independent) state (see 
Chapter 11), and that a second measurement is to be made at some time 
t2 > tı. That measurement will be made on the system in Y3, not in the 
state p, time evolved until t2. Again, this is simple, but easily overlooked. 

Perhaps part of the difficulty—part of the reason students overlook 
state collapse—is the way it’s manifested in calculations. All terms in a 
superposition that do not correspond to the actual measurement result 
obtained are simply erased! This just seems wrong. We generally demand 
some physical or mathematical justification for “throwing out” part of an 
equation—but not in this case. The postulate is our only justification, and 
erasure is precisely what’s called for. 


Appendix C 
The Harmonic Oscillator 


The simple harmonic oscillator (SHO) may be the single most important 
system in physics. The SHO describes vibrations in classical mechanics, 
electrical oscillations in an LC circuit, quantum-mechanical vibrations of 
molecules, and countless other physical systems. Even in quantum electro- 
dynamics—the quantum field theory of the electromagnetic field—the SHO 
plays a central role, providing a model for photons, the quanta of the field. 

The SHO is also one of the few systems for which the time-independent 
Schrödinger equation (TISE) is exactly soluble. In this appendix, I’ll briefly 
discuss what the SHO is, the SHO solutions to the TISE, and a quite 
abstract representation of the SHO: the “number representation,” which 
forms the basis for the SHO’s entry into quantum field theory. 


C.1 Energy Eigenstates and Eigenvalues 


Classically, the SHO is a system subject to a “restoring force” that is linear 
in displacement from the system’s equilibrium point. If we set x = 0 at 
that point, then the force is given by: F = —kz, where k is a positive 
constant. (Equivalently, the potential energy is V = ska.) The usual 
physical example of a classical SHO is a mass, m, attached to a spring. 
Such a system oscillates with a characteristic frequency, denoted w. The 
parameters m, w, and k are related by: w = \/k/m. 

In quantum mechanics, the SHO can be solved through “standard” 
means: we form the classical Hamiltonian, and then quantize it, rewriting 
it in terms of operators. The TISE can be solved exactly for this system, 
but we’ll leave the (non-trivial!) technical aspects aside, focusing instead 
on results. 

The SHO solutions of the TISE are analogous to the energy eigenstates 
of the infinite potential well (see Chapter 9). That is, they are energy 
eigenstates, in position representation—wavefunctions corresponding to a 
specific energy.! 


1 Detailed discussions of the quantum-mechanical SHO appear in nearly every stan- 
dard quantum mechanics text. Many, such as Griffiths (2005), Liboff (2003), Schiff 
(1968), and Townsend (1992), include plots of the first few energy eigenfunctions. 
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For the SHO, the wavefunction boundary conditions are applied at the 
point where the energy associated with a particular eigenstate equals the 
oscillator’s potential energy. At this point the wavefunction changes from 
oscillatory to monotonically decreasing—somewhat like what occurs in the 
step potential and the rectangular barrier (in Chapter 9) for the E < Vo 
case. 

The SHO is a remarkable system in part because of its energy 
eigenvalues 


En = hw (n+ 4), n =0,1,2,... (C.1) 


Note first that the eigenvalues are uniformly spaced—each separated by hw 
from the next. This is a special property, with special implications. 

Let’s write the nth SHO energy eigenstate as |n). Suppose we expand 
an initial SHO state |a) in these eigenstates: |a) = )),cj|j). Chapter 11 
then implies that the time evolved state is 


la) = D emesi) = She Bs) 


j 


= 7 [cos (wt (j + 4)) — isin (wt (+ 3) Jesl). (C2 


Equation (C.2) reveals that |a(t)) is periodic, with period T = 4r/w. But 
any SHO state can be expanded in the energy representation, so any SHO 
state is periodic with period T.? This is a remarkable property of the SHO, 
arising from its uniformly spaced energy eigenvalues. 

There’s another interesting feature of the SHO eigenvalues. Our classi- 
cal, mass-and-spring SHO could start at « = 0 with zero initial velocity. 
It would then simply remain at rest at x = 0; its energy would be 
precisely zero. 

Even for n = 0, however, the quantum oscillator’s energy is non-zero. 
This zero-point energy is a strictly quantum effect, and it’s not unique to 
the SHO. For example, the energy eigenvalues of the infinite potential well 
(see Chapter 9) are En = n?77h?/2mL?, with n = 1,2,3,...; the energy 
can never be zero. 

We can at least heuristically justify the zero-point energy by invoking 
the classical picture of a particle moving along the x axis. Classically, any 
non-zero energy corresponds to two different possible momenta: one in the 
+z direction, the other in the —z direction. The quantum-mechanical ana- 
log to this situation is an energy eigenstate corresponding to two different 


2 Actually, T is the longest possible period. If, say, co = 0, the period will be less 
than T. 
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momenta. This implies an uncertainty in momentum: Ap, > 0. This pic- 
ture fails, however, for a state with zero energy. Then there can be only 
one momentum, 0, so Ap; = 0.3 

Now consider the zero-point energy in terms of the position-momentum 
uncertainty relation: AvAp, > h/2. For a system constrained to a finite 
region of the x axis, as are the SHO and the infinite potential well, 0 < 
Ax < oo. As just argued, a non-zero energy eigenstate satisfies Ap, > 0, 
so it’s easy to see how the uncertainty relation, also, can be satisfied—that 
is, how ArAp, > 0. But a zero-energy eigenstate satisfies Ap, = 0, and 
since Az is finite, AxAp, > 0 can’t possibly be satisfied. 

By this (admittedly non-rigorous) argument, we see that we must have 
a zero-point energy. Interesting as that fact is, however, the zero-point 
energy takes on a much deeper significance in the context of quantum 
electrodynamics. 


C.2 The Number Operator and its Cousins 


As suggested above, the energy eigenstates and eigenvalues of the SHO can 
be obtained using “standard” methods. An alternative approach uses what 
are often called operator, or algebraic, methods. The key to such methods 
is the introduction of two operators, â and its adjoint ât, defined as: 


a= |™ (en 4 at [™ (z+ 
a oh (2+ 2), a OK (3 =p). (C.3) 


We leave solution of the TISE using â and ât to the standard texts. Our 
concern is the physical meaning of â, ât, and another operator, N , defined 
as Ñ = ala. 

Both â and ât are non-Hermitian operators,* and therefore do not rep- 
resent observable quantities (see Postulate 2, in Chapter 2). However, the 
SHO Hamiltonian operator H involves both ĉ and De, as do â and â't. 
It’s not too surprising, then, that H can be written in terms of @ and at, 
explicitly: 


Nile 


) = tw (N+ 3). (C.4) 


Evidently the operator âtâ (or Ñ) is Hermitian, since H is Hermitian." 
From Equations (C.1) and (C.4), we have 


fijn) = fw (ñ a 3) In) = E,|n) = hw (n + 4) |n), (C.5) 


3 Please realize that what’s crucial is that the spread in p is 0, not that p = 0 per se. 

4 This can be seen by inspection: â Æ ât, that is, â is not self-adjoint. 

5 The hw/2 term simply multiplies a state by a constant, and so cannot affect whether 
H is Hermitian. 
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where |n) is the nth energy eigenstate. Evidently, the action of N on the 
state |n) is to return the number n: N|n) = n|n). For this reason, N is 
called the number operator. 

To uncover the meaning of â and ât themselves, start with the 
commutation relation, 


where Eq. (C.3) and the commutation relation [ĉ, fs] = ih have been used. 
Now consider the commutator 


[Ñ a]|b) = (âtââ — aata) |b) = ât, a] aly) = —âlw). (C.7) 


(Here |Y} is an arbitrary state, included to clarify the calculation.) Thus, 


we can write Nâ — âN = —â, or Na=a(N-1). | 
Now let’s act on an SHO energy eigenstate with Nâ 


Na|n) = 4(N — 1)|n) = 4N|n) — a|n) = na|n) — a|n) = (n — 1)4|n). (C.8) 


Rewriting Eq. (C.8) as N[a|n)] = (n — 1)[@|n)] suggests thinking of a|n) 
as a new state. When the number operator, N, acts on G|n), it returns 
the number n — 1. Thus, 4@|n) = c_|n — 1). A similar calculation leads to 
N[at|n)] = (n + 1)[at|n)], so that @t|n) = cy|n + 1). (Here c_ and c4 are 
as-yet undetermined constants.) 

Up to the constants c} and c_, then, ât “raises” an SHO energy 
eigenstate to the next-higher energy eigenstate, and â “lowers” an energy 
eigenstate to the next-lower energy eigenstate. We thus refer to ât and â 
as raising and lowering operators, respectively.’ 

Note the similaritites between â and ât, on the one hand, and J_ and 
Ji. (of Chapter 8), on the other. All four operators are formed from the sum 
or difference of Hermitian operators, but the raising and lowering operators 
themselves are non-Hermitian. And in both cases—angular momentum, or 
SHO energy—the eigenvalues are uniformly spaced. 

Although the raising, lowering, and number operators may seem like 
curiosities, they can be quite useful in calculations. Moreover, the con- 
cepts and methods of the quantum SHO form a cornerstone of quantum 
electrodynamics, the quantum field theory of electromagnetic interactions. 


C.3 Photons as Oscillators 


In Newtonian classical mechanics, the fundamental entity is the particle. 
Only in the late 19th century did fields come to play a central role in 


6 By calculating (n|âât|n) and (nat Gln), the inner product of ât|n) with itself and 
the inner product of G|n) with itself, respectively, we find that c} = V/n+1 and 


-n u : ” ius : 1 a 
c— = yn. Thus, we could create “normalized” raising and lowering operators VS at 
1 A 
and Fê 
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classical physics, in the form of the classical electromagnetic field. At the 
same time it became clear that light is a manifestation of electromagnetism: 
an electromagnetic wave. 

In quantum mechanics, the fundamental entity is the quantum 
state—though we typically think of the state as representing particles 
(though the meaning of “particle” can be unclear in quantum mechan- 
ics). Quantum field theory—a subject well beyond the scope of this 
book—extends quantum physics to describe not just particles, but also 
fields.” 

In quantum electrodynamics, the entities that comprise the electromag- 
netic field—the field “quanta”—are called photons.® We need not delve 
deeply into the concept of a photon; we require only a simple picture of its 
role in quantum electrodynamics. 

For some physical system, let us call each allowable frequency and 
direction of electromagnetic wave a mode of the system. In quantum elec- 
trodynamics, the quantum field theory of electromagnetic interactions, each 
photon in a mode of frequency w contributes an energy hw to that mode. 
Compare this to a quantum SHO of frequency w: because its energy eigen- 
values are uniformly spaced in steps of hw, its energy can only be changed 
in “chunks” of hw. This suggests pressing the SHO formalism into service 
to represent photons. 

In quantum electrodynamics, photons are routinely created as the elec- 
tromagnetic field’s energy increases, and annihilated as the field’s energy 
decreases. Then the raising, lowering, and number operators take on phys- 
ical meaning: ât corresponds to creating a photon and â to annihilating a 
photon; as such, âf and â are called creation and annihilation operators, 
respectively, in quantum electrodynamics. For a state |k), the number oper- 
ator, âtâ, again returns k, which now denotes how many photons occupy a 
particular field mode. 

Finally, we return to the SHO’s zero-point energy. The similarities 
between the SHO and photons seem suggestive, but they need not imply 
that photons actually are oscillators. Still, the correspondence is quite deep. 

When the electromagnetic field is represented as photons, the SHO’s 
zero-point energy appears as the vacuum energy—an energy associated 
with the field even if there are no photons in any of the modes. 

Although the vacuum energy—the zero-point energy of the field—has 
very few observable consequences, it’s more than a mere curiosity. In quan- 
tum electrodynamics, the vacuum energy leads to a picture of the “vacuum” 
which is not at all empty, but full of activity, including the continual 
creation and destruction of virtual particle-antiparticle pairs. 


7 In quantum field theory, particles are often thought of as localized excitations of 
fields. 
8 A photon is not simply a “particle of light.” See Loudon (2000), pp. 1-2. 
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To venture further into the vacuum energy, and quantum electrodynam- 
ics generally, would lead us into particle physics and quantum field theory 
proper. These are subjects both wide and deep. If interested, I suggest you 
consult works that focus on these topics—but be forewarned, it can be very 
tough going! 


Appendix D 
Unitary Transformations 


In Section 5.4 I stated, without justification, the defining condition for 
a unitary operator: if U is unitary, then Ut = 0-1; ; that is, U's adjoint 
is equal to its inverse. I also pointed out that such Speen implement 
transformations of quantum-mechanical systems in space and time, and 
that such transformations are connected with conservation principles and 
symmetries. 

In Section 10.2.3 I stated, again without justification, that the general 
form of a unitary operator is U(7) = exp(iGy), with y a parameter and G 
a Hermitian operator (the time evolution operator, exp( —iHt /h), provides 
an example). 

Although much quantum mechanics can be done without further knowl- 
edge of unitary operators and transformations, a deeper acquaintance 
can be valuable. In this appendix I first discuss the unitarity condition, 
ût = Ô -1 and then introduce infinitesimal unitary transformations. From 
such a transformation, I obtain its finite counterpart, thus justifying the 
standard form U(7) = exp(iGy). The specific unitary operators of quan- 
tum mechanics are then introduced. Finally, I briefly discuss the connection 
between unitary transformations, symmetries, and conservation principles. 

This appendix comprises some of the more difficult material in this 
book. The hope is that it is nevertheless more condensed and digestible 
than many of the rigorous, comprehensive treatments found elsewhere. 


D.1 Unitary Operators 


The laws of physics are preserved under the following transformations in 
space and time:! 


e translations (e.g. along the x axis), 

e rotations (e.g. about the z axis), 

e velocity transformations (i.e., to a uniformly moving frame of refer- 
ence) ,” 

e time translations (time evolution). 


1 This statement is discussed in more detail in Section D.3. 
2 Transformations to accelerating reference frames are not included, since the laws of 
physics are not preserved under such a transformation. 
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Fig. D.1 f(x) = e77" is a Gaussian centered at « = 0; f(z- 4) = e(®=4? is 
the same Gaussian centered at x = 4. 


As an example of such a transformation, consider the function f (£) = e~*”, 


a Gaussian centered at x = 0. Then f(x—4) = e(®=4) is the same Gaussian 
translated so that it’s centered at x = 4 (see Fig. D.1). 

This picture of the translation of a function can help us determine 
the form of operator that would effect such a transformation in quantum 
mechanics. Suppose that |W) is some normalized quantum state, so that 
(|) = 1, and also that the operator U = Û(x) translates the state along 
the x axis, without otherwise altering the state. Then U|W) is the translated 
ket state, and, by definition of the adjoint of an operator (cf. Ch. 5), (¥|Ut 
is the similarly translated bra state corresponding to U|W). 

Because we have done nothing other than translate |W) and (Y|, the 
inner product of these translated states must be unaltered (invariant). 
That is, 


(wb) = (wô) (GIy) = (WOO) = (7| (ilw) (DD 


As indicated by the last expression in Eq. (D.1), if we regard Ût as acting 
to the right, the equality must still hold. But then we must have ut =0-}. 
This is precisely the condition that defines a unitary operator. 

Analogous arguments could be invoked for the other space-time trans- 
formations listed above. We conclude, therefore, that such transformations 
must be implemented by unitary operators, that is, operators satisfying 
at, 

Because the transformations listed in Section D.1 may be carried out 
to any extent—for example, a translation may be carried out through any 
distance—they are called continuous transformations.* 

Consider the special case of translation through an infinitesimal dis- 
tance. Although this is a mathematical abstraction—we could never 
physically translate anything an infinitesimal distance—it provides the 


3 To help see this, consider two real functions, a(x) and B(x), and their identically 
translated counterparts, a(x — b) and B(x — b) (with b some constant). The inner prod- 
uct (see Chapter 2) clearly cannot be altered by the translation: [°° a(x)G(«)dx = 
Son alx — b)B(a — b)da. 

An example of a non-continuous, or discrete, transformation, is reflection of a 


function about the origin, for example, e~ (* 2)? L end ea 
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starting point for finite translations. What sort of operator, denoted ĵÎ, will 
effect a translation through an infinitesimal distance 6x2? The answer is 


T(x) =1+iGrée. (D.2) 


Î is the identity operator, so iĜrôx must effect any change due to the 
infinitesimal translation. The operator Gr is called the generator of the 
translation, and the parameter dx determines how much of a translation 
is carried out—in this case, it’s infinitesimal. The 7 is introduced in antic- 
ipation of what’s to come (it could be formally eliminated by defining an 
operator G!, = iG, so that T(dx) = 1+ Ĝ! rôz). 

Now consider again the functions f(x) = e -2° and f(£x—4) = e7274), 
An inverse transformation takes us back to the original function. To be 
explicit, define g(x) = f(a—4); that is, g(x) = e7 78°, If we now translate 
g(x) by 4 in the —x direction, we have g(x + d = e{(et4)—4y? = ena” 
That is, we recover our original function. In fact, for all of the space-time 
transformations listed above, we expect that the original function can be 
recovered by an inverse transformation, similar to translating f(a) in the 
+z direction and then back again. 

This must hold for our infinitesimal translation, also. That is, we must 
have: T~!(6x)T (da) = I. This, in turn, will hold if T~!(6x) = [ —iGpéa:® 


?-1(52)P(5x) = (i = iGréz) (i m iĜròx) 
=P — iðrĜrİ + iðzİêr = f+ idx (Ĉr - Gr) =. 
(D.3) 
Now, because T is unitary, we must have T= fi, that is, 
Î-— iĜrôr = (Î + iGréa)'. (D.4) 


Note that Eq. (D.4) has not been derived; rather, it is a condition that 
must be enforced. How are we to do so? 

Consider a ket |) = cila1) + calaz), the corresponding bra (| = 
cj (ai| + ¢5(a2|, and an operator iA that satisfies iAla,) = iag|az). Then, 
defining a ket |Y’) = iAlw), we have: |W’) = iaicıļa1) + iagca|az). From 
Section 4.2.1, the bra (~’| corresponding to the ket |Y’) must be: 


(Y'| = i“aïci (a| + Ma3c3(a2| = —iaïci (a1| — ia3c3 (a2. (D.5) 
Thus, the operator that, acting to the left on (Y|, yields (w’|, is —i Ât. That 
is, (iA)' = —i At. The result of this argument is that Eq. (D.4) is indeed 
satisfied, i.e., T is unitary, if we insist that Gr = Gl that is, if Gr is 
Hermitian. 


5 In Eq. (D.3), factors of Î are dropped. The term Gr Gr (dx)? is also dropped, because 
it is of second order in the infinitesimal quantity ôx. 
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D.2 Finite Transformations and Generators 


If we carry out the infinitesimal translation through ôx repeatedly—in fact, 
approaching an infinite number of times—we obtain a translation through 
some finite distance x. 

Formally, we can rewrite ôx as: 


dx = lim —. (D.6) 


A finite translation of some function f(a) is then of the form: 


P(e) f(x) = ... (i ' iGr~) (i ! iGr—) (i+iGr=) f(x), (D.7) 


where I’ve shown only the first three of the N — oo infinitesimal 
transformations. From Eq. (D.7), then,°® 


X A N , 
Pa) = lim (i ot iGr—) = etr, (D.8) 


Unitary operators—including those that effect the space-time transfor- 
mations of Section D.1—are of the form of Î(x): an exponential whose 
argument includes i, a parameter (such as x), and a Hermitian generator 
(such as Gr). 

So far, we’ve said that Gr is the generator of translations, but what 
specific operator is Gr? We can largely answer this question independent 
of quantum mechanics. Consider translating some function F(x) in the +a 
direction by a small increment Ax (see Fig. D.2). The slope of F(x) at xo 


Fox) 
Fo) ee 


Fx) -AF(x) 


Ax 


Xo x 


Fig. D.2 Translation of a function F(x) in the +2 direction by a small incre- 

ment Ax. Here AF (x) © Ax dE | . In the limit Az — ôx, this approximation 
xo 

becomes exact. 


6 To verify Eq. (D.8), multiply out the first few terms in the limit and collect powers 
of iGrx/N. Then compare this with the Taylor expansion of the exponential. 
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is IE lro: In general, the slope is different at each x value. Within the small 


interval Ax, however, it’s approximately constant, so for the value of the 
translated function at zo we can write, 


F (ap — Ax) = F(a) — AF (ax) S F(a) — An 
T [ro 
rise ; 
= F(z) — run = F(xzo)— rise. (D.9) 


Here the terms “rise” and “run” have been introduced to aid understanding. 
In the limit Ax — ôx, F(x) is translated only infinitesimally, and Eq. 
(D.9) becomes exact: 


F(xzo — 6x) = F (xo) — da—]| . (D.10) 


Transcribing Eq. (D.10) into an infinitesimal translation operator, we have 


z r d 
T(x) = I — ôr—. (D.11) 

dx 
Equation (D.11) makes no reference to quantum mechanics. For quantum 
wavefunctions in position space, the linear momentum operator is: py, = 


ihe. Thus, in quantum mechanics, Gr = —p,/h, and T (52) becomes 
Flór) = Î — e (D.12) 


Equations (D.6) and (D.8) then imply that 
T(x) = e72, (D.13) 


which is indeed the correct finite quantum-mechanical translation operator. 

Similar arguments lead to the unitary operators for boosts and rotations 
(although the latter case is a bit subtle). The unitary operator for time 
translations is obtained from the time-dependent Schrödinger equation (see 
Chapter 11). The resulting unitary operators are 


T(x) = e` tPar/h 


Ü (t) = eith, (D.14) 


which correspond to translation by x, rotation about the z axis by 0, a 
velocity boost of v, and time translation by t.” 


7 In quantum mechanics, the operators themselves, exclusive of hi, are called the gen- 
erators. Thus, pz, not —pz/h, is the generator of translations, and similarly for 2, H, 
and Jz, the z angular momentum operator. 
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D.3 Continuous Symmetries 
D.3.1 Symmetry Transformations 


This is all quite elegant, but what is its physical significance? Although 
the crucially important time evolution operator was introduced, that’s also 
done in Chapter 11, and without recourse to such sophisticated methods. 
The physical significance of the preceding lies largely in the connection of 
unitary transformations to conserved quantities. 

This area of quantum mechanics can be quite difficult to grasp, so a 
careful exposition of terminology is warranted. In particular, what is meant 
by the terms symmetry and continuous symmetry transformation? 

If a cylinder is rotated about its axis through any angle, it looks the 
same after the rotation—its appearance is unchanged. Rotated off-axis, 
however, the cylinder looks different. We say that the cylinder’s appearance 
is invariant (unchanged), or symmetric, with respect to rotation about its 
axis, but not with respect to off-axis rotations.® 

Now consider translating the cylinder along its axis. Apart from this 
shift, the cylinder looks the same. This illustrates the importance of termi- 
nology: We cannot say whether the cylinder is invariant under translation 
along its axis until we choose a definition of invariance for this particular 
case. 

Symmetries in physics are often more abstract than that of a cylinder 
rotated about its axis, but they are united by the fact that when we say a 
physical system exhibits a symmetry, we mean that some feature of the sys- 
tem is invariant under some transformation. This seems vague, but it must 
be to encompass the many and varied transformations, and corresponding 
invariant quantities, in physics. 

A symmetry transformation, then, is simply some transformation under 
which a symmetry is exhibited, that is, under which some feature of the 
system is invariant. A continuous symmetry transformation is one for which 
the symmetry is exhibited for all values of some continuously varying 
parameter. Rotation of a cylinder about its axis (through any rotation 
angle) is one example. 


D.3.2 Symmetries of Physical Law 


Continuous symmetries in physics can be classified in a number of ways: 
geometrical or dynamical, internal or external, local or global.? For our 
purposes, however, I only wish to distinguish those symmetries that are 
inherent in Nature herself from those that are associated with a particular 
physical system. 


8 Even off-axis rotations leave the cylinder’s appearance invariant for certain rotation 
angles, such as 27. But these constitute discrete, rather than continuous, symmetries. 
9 These classifications, and other relevant aspects of continuous symmetries, are 
discussed in a lucid yet non-technical way in: Kosso (2000). 
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As pointed out at the beginning of Section D.1, the laws of physics are 
(so far as we know) invariant with respect to translations, rotations, veloc- 
ity transformations, and time translations. This statement deserves careful 
consideration: it does not mean that the numerical values of physical quan- 
tities, such as position or velocity, are preserved, but that physical laws, 
such as Newton’s laws of motion, or the Schrödinger equation, remain valid. 
Moreover, “invariance” is used here in a somewhat different sense than 
before. A symmetry of a physical system refers to the invariant nature of 
some feature of the system (e.g., the appearance of a cylinder). In contrast, 
the laws of physics are not a feature of any particular system, but of Nature 
herself. 

Invariance of the laws of classical physics is easily illustrated. A pas- 
senger on a train throws a ball directly upwards, and sees it travel straight 
up, and straight back down. To a trackside observer, however, the ball 
does not describe a simple linear path, but (because of the train’s motion) 
an arc. Yet in both cases Newton’s laws—in particular the second law, 
Fie = = T —are obeyed. 

How is the invariance of physical law reflected in quantum mechan- 
ics? One way is through the invariance of inner products, enforced by the 
unitarity condition, Ut = U-!, developed in Section D.1: 


(g|UTU|0) = (g|U~" U6) = (916). (D.15) 


Physically, this invariance of inner products reflects the fact that the pro- 
jection of one state onto another, and thus all probabilities, cannot depend 
on whether we perform an experiment in a laboratory “here”, or in a lab 
that is translated (in space or time), rotated, or uniformly moving with 
respect to it. 

Yet even classically, this isn’t how the world seems to work. For exam- 
ple, compare dropping a ball near Earth’s surface to dropping it a few 
thousand kilometers above the surface. Then the difference in observed 
behavior of the ball does not simply arise from an altered point of view. 
The ball actually behaves differently, because of the altered gravitational 
field (although in both cases, the laws of classical physics are still obeyed). 

The quantum-mechanical situation is similar. For example, if an atom 
is in an electric field, it possesses more energy levels than when it is not. 
Thus, translating an atom into or out of such a field actually changes its 
spectrum. 

The problem is again one of terminology. The invariance of physical 
law under translation refers to translation of the entire physical situa- 
tion—fields and all—in space. Thus, our reference to the invariance of 
experimental results due to translation or rotation of the laboratory implic- 
itly assumes that everything upon which the experiment’s outcome depends 
is contained within the laboratory itself. 
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The invariance of physical law provides the foundation for our use of 
unitary operators. I now turn to symmetries of particular systems. 


D.3.3 System Symmetries 


Because of the invariance of physical laws discussed in Section D.3.2, all 
translations, rotations, boosts, and time translations are continuous sym- 
metry transformations. But symmetries may also be associated with a 
particular feature of a particular system, and in that context, only those 
transformations for which that system feature is invariant are properly 
called symmetry transformations. 

Our cylinder’s appearance was invariant for on-axis rotations, but 
not for off-axis rotations. In contrast, a sphere is invariant with respect 
to rotations about any axis that passes through its center. Many other 
shapes aren’t invariant with respect to any rotations (pick up a stone 
and rotate it). In this context, whether a rotation is a continuous symme- 
try transformation depends on the particular “system” (i.e. the particular 
shape). 

In quantum mechanics, it’s H that characterizes a particular system, so 
it’s H’s properties that characterize a system’s symmetries. Recall (from 
Section 5. 4) that an operator A transforms under a unitary operator U 
as: A> A’ =U +AU. Thus, if Å is invariant under the transformation 
corresponding to U, then H must satisfy!° 


UtHU = H — [H,U] =0. (D.16) 


If Eq. (D.16) is satisfied, then the observable corresponding to the generator 
of the transformation U is a constant of the motion (see Section 11.3.2). 

Consider, for example, the unitary translation operator T (see Eq. 
(D.14)). If IAr, Î] = 0, then ÎI AP = H. Thus, H is translationally invari- 
ant along the x axis, and p,—the observable corresponding to the generator 
pe—is a constant of the motion. That is, d(p,)/dt = 0. 

This is an example of Noether’s theorem, a famous result that’s 
applicable in both classical and quantum mechanics. Noether’s theorem 
states that, corresponding to each continuous symmetry transformation 
of a system there is a conserved quantity.!4 Referring to Eq. (D.14), we 
see that 


z] = 0 — conservation of angular momentum in the z direction, 
. [H, V] = 0 — conservation of x position, i i 
3. [H, H] = 0 — conservation of energy, unless H = H(t). 


10 Jf [Ñ Û] = 0, then Öt AÙ = ÔtÛ A = 0-08 = A. 
11 Noether’s theorem in classical mechanics is discussed in: Hand (1998), pp. 170-175. 
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If, in addition to [H, R] = 0, we also have [H, R+] = 0 and [H, R,] = 0 
(where R, and Èy are rotation operators about the x and y axes, 
respectively), then total angular momentum is conserved. 

For case 2 above, note that ĉ is the generator of the transformation Ŷ, 
while p2 /2m is the kinetic energy operator, and thus part of H. But (2, Pz] 
never vanishes, so this case is of little interest. 

For case 3, [A A ] = 0 trivially. This means that the system energy is 
a constant of the motion, unless H depends explicitly on t. 

In this book, our use of unitary operators is limited, apart from the time 
evolution operator. In some areas, however, such as particle physics, con- 
tinuous symmetries and the related conservation principles form a central 
topic. 
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